Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_0523 |
Symbol | |
ID | 3578989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 589304 |
End bp | 592438 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637684211 |
Product | putative ATP-dependent DNA helicase |
Protein accession | YP_288584 |
Protein GI | 72160927 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0210] Superfamily I DNA and RNA helicases [COG2887] RecB family exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACAACC CCCCGTACCG TCTTGTACGC CGCATCCGGC ACCGTGCGCC CGCACCCGAG CTCGACGACG ACCAGCGCCG GGTCGTCGAC CACGAAGGCG GCCCGCTGCT CGTGCTCGCC GGTCCAGGAA CCGGGAAGAC CACGACGATC GTCGAAGCGA TCGTCGACCG GGTCGAGAAC CGGGGCGTGG ACCCCGCGCA CGTGCTCGTC CTCACCTTCA GCCGCAAAGC GGCCCAAGAA CTCCGGGAAC GCATCACCGC CCGGCTGCGG CGCACCGTCC GCGAACCGCT CGCCCTCACC TTCCACAGCT ACGCCTACGC GCTGATCCGC CGCGAATTCC AGCGGGTCGG CGACCACGCG CCCCGCCTCC TGTCCGGCCC GGAGCAGCTG ATGGAAGTGC GGGAACTCCT CAAAGGCGAA CTCGCTGACG GCGCCCCCGG GTGGCCCGAA CAGCTCCGCC CCATGCTGCA CACCCGCGGC TTCGCCGAGG AGCTCCGCGA CTTCCTCATG CGGGTGCAAG AACGCGGACT CCACGCCGAC GACGTCCGGG AGCTCGGCCG CCGCCACGGG CGAGAAGACT GGGTGGCCGC CGGGGGATTC CTGGAACGCT ACACCGGCCG CTTCGACATC GCCCCCGTGC CCACCTTCAA CTACGCGGAA CTCGTGCGGG TCGCCGCCAA CCTGCTCAAC GACCCGGACA TCCAGGCGCG GGAGCGTGCC GCCCGCAAAG TCGTGTTCGT CGACGAATAC CAGGACACCG ACCCCGCCCA GGAGGAACTG CTTCACGCCC TGGCCGGAGA CGGCCGCGAC CTGGTCGTGG TCGGCGACCC CGACCAGTCC ATCTACGGTT TCCGGGGAGC TGAAGTCCGC AACATCCTCA ACTTCCCCGA CCGGTTCCGC ACCCCCACAG GAGACCCCGC GCCCGTGGTG GCGCTGCACA CCTGCCGCCG CAGCGGCGCG GAACTGCTGC GGGTCTCCCG GGCCCTCGCC CAGCGGCTGC CCGCCGTCCC CGGGCCCGGC GGCGAAGGGG TCAACGCGCA CCGCAACCTC GTGCCCGCCG AAGGGACACC CCCGGGACGG GCTCGAATGC TCCTCGCGGA AAGCCCGGTC CACGAAGCCG CTGTCATCGC GGACCTGCTG CGCCGCGCCC ACCTCATCGA CGGAGTGCCC TGGTCGCGCA TGGCAGTCCT TGTCCGCTCC GTGACCCGAC ACGTCCCCGT ACTGCGCCGC GCACTCATCG CCGCCGACGT CCCCGTGACC GTCGACGGCG ACGATCTGCC GCTGGCGTCC GAACCCCTTG TGCGGTCCAT GCTGCTGCTC CTCCGCTGCG CATTCCACCC GGAGCGGTTG GACGAGGAAG CCGCCCACAA CCTGTTGACG AGCGCGTTCG GAGAAGCCGA CGCGCTCGGA CTGCGGCGGC TCGGGCGCGC CTTGCGACAA CTGGAACTGG ACGCCGGCGG CCGCCGCCCC GCCGCGGCCC TGCTCGCTGA GATCCTCCAC GACCCCCGCG ACCTGGTGAT GGTCGACCCG GAGGTGCGGG CCCCTGCGGA GCGGATCGCC ACCCTGCTGC GCCTGGTCCG CGACAGCATC GCCGAAGGCG CCAACGCAGA AGAAGTGCTG TGGCGGATGT GGCACCACTC CGGGCTCGCC GACACCCTGC TGCGCGCCAG CCAAGCGGGC GGCCGCCACG GTGCGGCCGC AGACCGGGAA CTGGACTCGG TGATGGCGCT TTTCGAGCGG GCCGCCCGCT ACTGCGACCG GCTGCCGCCG GGTTCCCCGG AAGGCTTCCT GGAAGACCTG GAAGCCCAGG AGATCCCCGG GGACACGCTC GCCGAACAGG CCCCCCAAGG AGAGACGGTG CGGATTCTCA CCGCCCACCG CTCCAAAGGA CTGGAGTGGG ATCTGGTCGT GGTTGCCGGC GTCCAAGAAG GCACCTGGCC GGACCTGCGG CTGCGCGGTT CCCTGCTCGG CGTGGAACAG CTTCTCGACA CGGTGGGCGG TTTTGCGGAG ACTTCCCCGG CCGCCGTGGT GTCCAAGCAG CTCGACGAAG AACGGCGGCT GTTCTACGTC GCGCTCACCC GGGCCCGGCG GGAACTCGTG GTGACCGCGG TCGGCGGGGA GGACGTCGAA GAACGGCCTT CCCGGTTCCT CACCGAACTC GGCCTCGGCG AACCGGAACG CCTGGCCACC GGATACCGCT GGCTGTCGCT GCCCGCCCTG GTCGCCGACC TGCGCGCCAC GCTCCTCGAC CCGCACACCG ACGAGCCGGT GCGCCGCGCC GCAGCCGCCC ATCTGGCCCG CCTCGCCGAT GCGGGAGTGC GCGGCGCCGA CCCCGCGGAG TGGTACGCCC TCACCGAACT CTCCGACAGC TCACCCCTCG TCCTCGAAGG GGAGCAGATC CGTATCTCCC CTTCCCAAGT GGAGAAGTTC ACCACCTGTG AACTGCGCTG GCTGCTGGAG ACCGCCGCGG GGGCGGAAAA ACGCCGGGCC GCGTCCGGGA TCGGCAGCAT CGTCCACGCG CTGGCCCGCA TCGCCGCGGA GAACCCGGAC CTTCCGGAAC TCCTGCGGCG CATGGACCAG ATCTGGTCCG ACCTTGACTT CGGCGGCCCG TGGTACGCCG AAAAACAGCG GGAGCGCGCC GAGGAGATGC TGCAGCGGTT CCTGGACTGG CAGAAGGAGA ACCCGCGGGA ACTCGTCGCC ACCGAGAAAA AGTTCCGGGT CGAAGTCGGC AACATCGAAA TCTCCGGCCA AGTGGACCGG CTGGAACGCG ACGCGGAAGG ACGCGGTGTC ATCGTCGACA TCAAGACCGG GACCGCGGTA CCGGACCGGG AGATCAGCCG CCACCCCCAG CTCGGCGTCT ACCAGTTGGC GCTGCTGATG GCGGCGTTCG AACACTACGG CCTGGTCGAG CCGGGCGGCG CAGCACTGCT GCAGATCGGC GACCGGAAAA CCGCGAAAGA GCAGACCCAG CCGGCCCTCG CCGACGACGC GGACCCTGAA TGGCCGCAGC GGCTGGTGCA GAAGGTGGCC GCGGGCATGG CGGGGGCCCG GTTCCGCGCG AAAGCCACGC CGTCGTGCCG GCACTGTTCA GTCCGGGCAA GCTGCCCAGT GCAGAGCGAA GGCGACCACG TCTGA
|
Protein sequence | MDNPPYRLVR RIRHRAPAPE LDDDQRRVVD HEGGPLLVLA GPGTGKTTTI VEAIVDRVEN RGVDPAHVLV LTFSRKAAQE LRERITARLR RTVREPLALT FHSYAYALIR REFQRVGDHA PRLLSGPEQL MEVRELLKGE LADGAPGWPE QLRPMLHTRG FAEELRDFLM RVQERGLHAD DVRELGRRHG REDWVAAGGF LERYTGRFDI APVPTFNYAE LVRVAANLLN DPDIQARERA ARKVVFVDEY QDTDPAQEEL LHALAGDGRD LVVVGDPDQS IYGFRGAEVR NILNFPDRFR TPTGDPAPVV ALHTCRRSGA ELLRVSRALA QRLPAVPGPG GEGVNAHRNL VPAEGTPPGR ARMLLAESPV HEAAVIADLL RRAHLIDGVP WSRMAVLVRS VTRHVPVLRR ALIAADVPVT VDGDDLPLAS EPLVRSMLLL LRCAFHPERL DEEAAHNLLT SAFGEADALG LRRLGRALRQ LELDAGGRRP AAALLAEILH DPRDLVMVDP EVRAPAERIA TLLRLVRDSI AEGANAEEVL WRMWHHSGLA DTLLRASQAG GRHGAAADRE LDSVMALFER AARYCDRLPP GSPEGFLEDL EAQEIPGDTL AEQAPQGETV RILTAHRSKG LEWDLVVVAG VQEGTWPDLR LRGSLLGVEQ LLDTVGGFAE TSPAAVVSKQ LDEERRLFYV ALTRARRELV VTAVGGEDVE ERPSRFLTEL GLGEPERLAT GYRWLSLPAL VADLRATLLD PHTDEPVRRA AAAHLARLAD AGVRGADPAE WYALTELSDS SPLVLEGEQI RISPSQVEKF TTCELRWLLE TAAGAEKRRA ASGIGSIVHA LARIAAENPD LPELLRRMDQ IWSDLDFGGP WYAEKQRERA EEMLQRFLDW QKENPRELVA TEKKFRVEVG NIEISGQVDR LERDAEGRGV IVDIKTGTAV PDREISRHPQ LGVYQLALLM AAFEHYGLVE PGGAALLQIG DRKTAKEQTQ PALADDADPE WPQRLVQKVA AGMAGARFRA KATPSCRHCS VRASCPVQSE GDHV
|
| |