Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0803 |
Symbol | |
ID | 5669219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 937766 |
End bp | 941332 |
Gene Length | 3567 bp |
Protein Length | 1188 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239731 |
Product | transcription-repair coupling factor |
Protein accession | YP_001505167 |
Protein GI | 158312659 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0214899 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTCG CGCCGCTGCT CGACGCCCTA GTCGCCCGCC CCGGTGGTGA TCCAGCGCTG ACCCGCGCGC TGGGCTCGCC GGACGAACCC GTCCTCGACC TGGCCGGGCC CGCGGCGCTG CGCCCGTTCG CGGCGGCGGC GATGGCCCGG GCCGGTCACA CCGTCCTCGC GGTGACGGCG ACCGGCCGTG AGGCCGAGGA CCTGGCCGAC GCGGTGGGCA GCCTGCTGGG TTCCGAGCGC GTCGCGGTGT ACCCGAGCTG GGAGACGCTG CCCCACGAGC GGCTCTCGCC GCGGGCGGAC ACGGTGGGGC GCCGGCTGGC CGTCCTGCGC CGGCTCGCGC ACCCGGGCAC CGGCGGTGCC ACCGGTGCGG GCCCGCTGGC CGTCGTGGTG GCCCCGGTCC GCTCGGTGCT CCAGCCCCAG GTGGCCCGGC TGGGTGAGCT GGCCCCGGTC GCGCTGGCGA AGGGGGACAC CGCCGACCTC GAGGACGTCA CCGCCCGCCT GGTGGGCATC GCCTACCACC GCGTCGACCT GGTCGAGCGG CGCGGCGAGA TGGCCGTCCG CGGCGGGATC CTCGACGTCT TCCCGCCGAC CGAGGAACAC CCGCTGCGGA TCGAGTTCTT CGGCGACGAG GTGGACGACA TCCGCCGCTT CTCCGTCGCC GACCAGCGCG CGCTGCCCGC GGAGGACGGC GAGCCGGCGG CCGAGCTGTT CGCGCCGCCG TGCCGGGAGC TGCTGCTCAC CGACGAGGTC CGGGCCCGCG CGGCGGAGCT GGCGCGGGAG CACCCGCAGC TCGTCGACAT GCTTGACAAG ATCGCCGAGG GCATCCCGGT GGAGGGCATG GAGGCGCTCG CGCCCGTGCT CACCGACGAG ATGACCCTGC TGCTGGACGA GTTGCCGGCG GGCTCGCGCG TGCTGGTCTG CGATCCCGAG CGGGTGCGCA CCCGGGCCGG CGAGCTGGTC CGCACCAGCC AGGAGTTCCT GGACGCCTCG TGGAGCGTCG CGGCGCTCGG CGGCGGGGCG CCGATCGACC TGGGCGCGGC GACCTACCGT TCGGTCGCCG ACGTCCGGGA GCGGGCCGCC GAGCTCGGTA TCCCCTGGTG GTCCGTCAGC CCGTTCGCCT CCGAGCCGGG AGCCGGGGCC GAGGGTGAGG GTGGGGACAC CTTCGGTGAC GACACCGTCG TCGCCAGCCT GCGGCCGGCC ACGGCCTACC ACGGGGACAC CGCGCGGGCC ACCGCGGACA TCAAGGGCTG GCTCGCCGAC TCCTGGCGGG TGCTGCTCGT CACCGAGGGC CACGGCCCGG CCGAGCGGCT CGTCGAGATG ATGCGGGAGG CCGATCTCGG CGCGCGGCTG GCGGCCGACG CCGAGCTCGC CGCGGGGGTC GCCGTCGTCA CCCAGGCCCA GCTCGGCGCC GGGTTCGTCA GCCCCACGCT GCGCCTCGCC GTGCTCACCG AGACCGACCT CGCCGGCGCC CGGGGGGTCA CCACCCGGGA CATGCGCCGG ATGCCGAGCC GGCGGCGCAA GGGCATCGAC CCGCTCGCGC TGAGCGCCGG CGACCTCGTC GTGCACGACG CCCACGGTGT CGGCCGCTAC GTCGAGATGG TCACCCGCAC GGTGGCCGGC GCCAAGCGCG AGTACCTGCT GCTCGAGTAC GCCCGCGGCG ACCGCCTCTA CGTGCCGACC GACCAGCTCG AGCAGATCAG CCGCTACGTC GGCGGCGAGG GCCCGAGCCT GGACCGCATC GGCGGCGCCG ACTGGGGCAA GCGCAAGAGC CGGGCGCGCA AGGCCGTCAA GGAGATCGCC GGCGAGCTGA TCCGGCTGTA CAGCGCCCGG ATGGCCGCCC CCGGCCACGC CTTCGGCCCC GACAGCCCCT GGCAGCGTGA GCTGGAGGAC GCCTTCCCCT TCCGGGAGAC GCCCGACCAG CTCGCGGCCA TCGACGAGGT CAAGGCCGAC ATGGAGAAGC CCGTCCCGAT GGACAGGGTG ATCTGCGGCG ACGTCGGCTA CGGCAAGACC GAGATCGCCG TCCGGGCCGC GTTCAAGGCG GTCACGGACG GCCGCCAGGT CGCCGTCCTC GTCCCGACCA CGCTGCTCGT CCAGCAGCAC TTCCAGACCT TCGCCGAGCG GTACGCGCCG TTCCCGGTCA CGGTGAAGGC GGTGAGCCGG TTCAACGCGC CGTCCGAGCA GCGCGCCGTG CTCGACGGCC TCGCCAACGG CACGGTGGAC GTCGTCATCG GCACGCACCG TCTGCTCTCC AGCGAGACGA AGTTCTCCGA CCTGGGCCTG GTGATCGTCG ACGAGGAGCA GCGCTTCGGC GTCGAGCACA AGGAGCACCT CAAGAAGATG CGGACCGCGG TCGACGTGCT CACCATGAGC GCCACGCCGA TCCCGCGGAC GCTGGAGATG TCGATCACCG GCATCCGGGA GCTGTCGACG ATCGACACCC CGCCGGAGGA GCGCCATCCG GTGCTGACCT CGGTGGCGCC CTACGAACCG CGCCAGGTGA CCGCGGCCAT CCGGCGTGAG CTGCTGCGCG AGGGGCAGGT GTTCTTCATC CACAACCGGG TGGAGAGCAT CGACCGGGCC GCCGCCGCGC TGCGCGAGCT GGTGCCCGAG GCGCGGATCG CCACCGCGCA CGGCCAGATG AACGAGGACG CCCTCGAGCA GGTCATGGTC TCCTTCTGGG AGAAGAAGTT CGACGTCCTG GTCTGCACGA CGATCGTCGA GTCGGGCCTG GACATCTCGA ACGCGAACAC GCTGATCGTC GAGCGGGCCG ACAACTTCGG GCTCTCCCAG CTGCACCAAC TGCGCGGGCG GGTCGGCCGG GGCCGTGAGC GCGCCTACGC CTACTTCCTG TACCCGGCGG ACCGGCCGCT CTCGGAGACC GCGCACGACC GGCTGGCCAC CATCGCCCAG CACAACGACC TGGGCGCCGG CATGGCCGTC GCGATGAAGG ACCTGGAGAT CCGCGGTGCC GGGAACCTGC TCGGCGGCGA ACAGTCCGGC CACATCGCGT CGGTCGGTTT CGACCTGTAC GTCCGGATGG TCGGCGAGGC CGTCGCCGAG TACAAGGGCG AGGCCGAGGA GCCGCCGGAG GTCAAGGTCG AGCTGCCCGT CGACGCGAAC CTGCCGCACG ACTACGTGCC CAGCGAGCGG CTTCGGCTCG ACGCCTACCG CCGGCTGGCC GGCGCGGTCT CCGACGCCGA CATCGCCGAG GTCCGCGGCG AGCTGACCGA CCGGTTCGGC CTGCTGCCCG AGCCGGTGGA GAACCTGCTC GCCGTCGCCG GCCTGCGGGT GCTCGCCCGC CGGTTCGGCG TCACCGAGAT CACGGTGGCC GGCCGGCAGG TCCGGTTCGC CCCGCTGGAG CTGCGGGAGA GCCAGACCCT GCGGCTGACC AGGCTCTACA AGGGCGCGGT GGTCAAGCCG GCCATGCGCA CGGTGCTGGT GCCGGCGCCC ACCGAGACCG GCCGGATCGG CTCCCGGCCG CTGCGCGACC GTGCGCTGCT GGCCTGGGCC GGCCAGTTGC TGGAGGCCGT CGCCGGCGAC TCGGTGGCCG CCGCCGCAAG TCGGTGA
|
Protein sequence | MTLAPLLDAL VARPGGDPAL TRALGSPDEP VLDLAGPAAL RPFAAAAMAR AGHTVLAVTA TGREAEDLAD AVGSLLGSER VAVYPSWETL PHERLSPRAD TVGRRLAVLR RLAHPGTGGA TGAGPLAVVV APVRSVLQPQ VARLGELAPV ALAKGDTADL EDVTARLVGI AYHRVDLVER RGEMAVRGGI LDVFPPTEEH PLRIEFFGDE VDDIRRFSVA DQRALPAEDG EPAAELFAPP CRELLLTDEV RARAAELARE HPQLVDMLDK IAEGIPVEGM EALAPVLTDE MTLLLDELPA GSRVLVCDPE RVRTRAGELV RTSQEFLDAS WSVAALGGGA PIDLGAATYR SVADVRERAA ELGIPWWSVS PFASEPGAGA EGEGGDTFGD DTVVASLRPA TAYHGDTARA TADIKGWLAD SWRVLLVTEG HGPAERLVEM MREADLGARL AADAELAAGV AVVTQAQLGA GFVSPTLRLA VLTETDLAGA RGVTTRDMRR MPSRRRKGID PLALSAGDLV VHDAHGVGRY VEMVTRTVAG AKREYLLLEY ARGDRLYVPT DQLEQISRYV GGEGPSLDRI GGADWGKRKS RARKAVKEIA GELIRLYSAR MAAPGHAFGP DSPWQRELED AFPFRETPDQ LAAIDEVKAD MEKPVPMDRV ICGDVGYGKT EIAVRAAFKA VTDGRQVAVL VPTTLLVQQH FQTFAERYAP FPVTVKAVSR FNAPSEQRAV LDGLANGTVD VVIGTHRLLS SETKFSDLGL VIVDEEQRFG VEHKEHLKKM RTAVDVLTMS ATPIPRTLEM SITGIRELST IDTPPEERHP VLTSVAPYEP RQVTAAIRRE LLREGQVFFI HNRVESIDRA AAALRELVPE ARIATAHGQM NEDALEQVMV SFWEKKFDVL VCTTIVESGL DISNANTLIV ERADNFGLSQ LHQLRGRVGR GRERAYAYFL YPADRPLSET AHDRLATIAQ HNDLGAGMAV AMKDLEIRGA GNLLGGEQSG HIASVGFDLY VRMVGEAVAE YKGEAEEPPE VKVELPVDAN LPHDYVPSER LRLDAYRRLA GAVSDADIAE VRGELTDRFG LLPEPVENLL AVAGLRVLAR RFGVTEITVA GRQVRFAPLE LRESQTLRLT RLYKGAVVKP AMRTVLVPAP TETGRIGSRP LRDRALLAWA GQLLEAVAGD SVAAAASR
|
| |