Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2402 |
Symbol | |
ID | 7094324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011667 |
Strand | - |
Start bp | 63812 |
End bp | 65776 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643701088 |
Product | protein of unknown function DUF524 |
Protein accession | YP_002364229 |
Protein GI | 217980179 |
COG category | [S] Function unknown |
COG ID | [COG1700] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 0.63975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 95 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTGCG TTCCGCTGCA TCGCGCGAGA TTCGACCTAG GCGCGGGCTG CTGGATCGAC ATCGAAGCCG ACCTTGGCCA AGGCAGGCCG CTCGCCGAGG GCGTTATCCA GTCGGACTCG CTGATCTCCG CTGAGGAAGA GACCAAGGCT GAGGAGATCT CTTGGGGCGG TATTCCGCCG ATGTGCTGGC GTGACGCTGG AGAACTGTTC GAGAAGTTCC AGCTTCGTGA GGACACCGAC TACTTTGTCG ACGTGGCTAC GCCCTCGTCG ATACAAGAGG CCATCCATCT GTCTCAAGAA AATCCTACGT GGCCGTTCGA GCCTAGGCTC GCGAACGCCT TCACGCGGGA GCCAGCCAGG CGCTGGCGGC AAGAGGGTGG CAAAACGGTG GTCACAGGGC AGATTCGCTT GCGCTCGCAC GCCGGCGTAT TGAACCTCTC GCCCGTGTTC GGAGGAGAGG TGCGAGCCGA GGTCGCCTGT CGAAAGCTGC GTTACTTTGA GGAGTTCAAG CGTCTGCTGG ACGAACTGGC AGAGAAGGCC ACAGAGCTAC TGCTTTCCTA CGACAGCCCG GTTTCTCTGA ACTTTCAGAC GACTGACGAT CTCGCAAAAA ACGAATCCGC CCTGCACTTC CTGATGCGGC ATGTGATGGC AAAGGAAAGG CTCCCGATGT CCATCGAGGA GATCGTAGAG CGACCGCATG TGCGCTTGGT GGAACGCGTG GAGCCCACGC CGATCGACGA AATCCAAGAA GCCGATCCAG AACTAGTCGC TGACGGGCTG GACTACTCTG AACTCAGCCC AAGTGGTCCC TTAGCTCGCC TATTTAGAGG TTTCACGCCT ACAGCGCTGC CTCATCGCGA GAGTTACGAA TCCCTCGATA CGCCTGAGAA CCGCTATGCC AAAGCTTTCT TGGAGCATTG CAGCCTTGTC TCACGGCGGC TTGAAGGCGC GTTGGCCTCA CAGGGACGGC GAGCGTCAGC GCGCGAAGCT CGCGCTTGGG GCGTGTCGCT CGACGAAGCT TTGCAGCATG GAATGTGGCG AGATGTGGGC CCTCTCACTC AAATCCCTGC AAACTCGCAG ACGCTCTTGC GAAAGCGCGG TTACAAGGAT CTGCTGCGCT ACGACCTTTC GTTGCGCATG GCGCTGGAAC TCGCGTGGAA GGAAGGTGCA CAACTCTCCG ACGGACTCTC CGGCGACATC CGCCCAGTAA ACCAGATCTA CGAGTACTGG TGCTTTTTCT GCCTTCGGGA GATCCTTCTT TCGCTGTGCG TCGAAATTGG AGGCGGTAAC TTCCTGACCG TGAGCAAGGA CGGCCTGAAG GTGCAACTTG CCAAGGGGGC TCGAAGCGAG TGCCGCTTCG AGTTCACAGG GGACAGCGGC GCCAAAGTTC GCGTCTCACT CTTCTTCAAC CGCCGCTTTC GCCGCCCGAA ATCGCCGCAG TCGGCGTGGG AGGGCAGCTA CACCGCATCT TTCGATCCCG ATTTCAGCAT CCGGCTGAGC AAGGCCGCCG CAGACTTGCC ATCGCATTGG CTTCACTTCG ACGCTAAGTA TCGGCTCGAG AGGCAACAGT CGGAGACCTT GTTCGAAGAA GCGCCCGACG GCGAGCAGGA TGGTGGAATA GCTGATTACG AAGCCGAAGT GGCACGAGTG CACAAGCTTG AGGATCTCTT CAAGATGCAC ACGTATCGGG ACGGAATCCT TGGTACACGG GGAGCCTACG TACTCTTTCC TGGTGACGGC GTTGGAGGCA TCGTCAGCGC GCCAAAGCCC AATCTCTTTG TTCGGAACCC GGCCGCGTTT GGCGGTACGG GGTCCCATCA AATCCCGAGT GTCGGCACCT TCGACTTGGC CCCAGGTGGT GGCGCTGAGC AAAAGCAGGC CATCGCTTCG CTGCTGACTA GCGTACTCGA AGCAGTCGCG GGAGCGCCTA CCTATCAAGA GGAATATGGT TACTGGACCC CGTAA
|
Protein sequence | MSCVPLHRAR FDLGAGCWID IEADLGQGRP LAEGVIQSDS LISAEEETKA EEISWGGIPP MCWRDAGELF EKFQLREDTD YFVDVATPSS IQEAIHLSQE NPTWPFEPRL ANAFTREPAR RWRQEGGKTV VTGQIRLRSH AGVLNLSPVF GGEVRAEVAC RKLRYFEEFK RLLDELAEKA TELLLSYDSP VSLNFQTTDD LAKNESALHF LMRHVMAKER LPMSIEEIVE RPHVRLVERV EPTPIDEIQE ADPELVADGL DYSELSPSGP LARLFRGFTP TALPHRESYE SLDTPENRYA KAFLEHCSLV SRRLEGALAS QGRRASAREA RAWGVSLDEA LQHGMWRDVG PLTQIPANSQ TLLRKRGYKD LLRYDLSLRM ALELAWKEGA QLSDGLSGDI RPVNQIYEYW CFFCLREILL SLCVEIGGGN FLTVSKDGLK VQLAKGARSE CRFEFTGDSG AKVRVSLFFN RRFRRPKSPQ SAWEGSYTAS FDPDFSIRLS KAAADLPSHW LHFDAKYRLE RQQSETLFEE APDGEQDGGI ADYEAEVARV HKLEDLFKMH TYRDGILGTR GAYVLFPGDG VGGIVSAPKP NLFVRNPAAF GGTGSHQIPS VGTFDLAPGG GAEQKQAIAS LLTSVLEAVA GAPTYQEEYG YWTP
|
| |