Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0051 |
Symbol | |
ID | 7083434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 54144 |
End bp | 57566 |
Gene Length | 3423 bp |
Protein Length | 1140 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697099 |
Product | conserved hypothetical cytosolic protein |
Protein accession | YP_002353748 |
Protein GI | 217968514 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.609156 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACC CGCAATCCCT CGGCCTCGAC TTCGTCGCCG ACGACACCCT GTCCGGCTTC CGCCTGCAGC GCCTGGAGGT GCTCAACTGG GGCACCTTCG ACCGCCACGT GTGGACGCTG CAGCTCGACG GCCGCAACGG GCTGCTCACC GGCGACATCG GCTCGGGCAA GTCCACCCTG GTCGACGCGA TCACCACCCT GCTCGTCCCC GCGCAGCGCA TCGCCTACAA CAAGGCCGCC GGCGCCGACA CGCGCGAGCG CAGCCTGCGC TCCTACGTGC TCGGCCACTA CAAGTCCGAG CGCAACGAGG TCACCGGCGC CGCGCGCCCG ATCGGCCTGC GCGAGCCCGG CAGCTACTCG GTGATCCTCG GCGTATTCCA CAACGCCGGC TACGACCAGA CCGTGAGCCT GGCCCAGGTG TTCTGGTTCA AGGAGGCGCA AGGCCAGCCG GCGCGCTTCT ACCTCGGCGC CGAGCACGCG CTGTCGATCA CGGACGATTT CTCCGGCTTC GGCCCCGACA TCACGCAGCT TCGCAAACGC CTGCGCGGCG CCGGGGCCGA GGTGCTCGAC AGCTTCCCGC CCTACGCAGC GTGGTTCCGC CGCCGCTTCG GCATCGACAA CGAGCAGGCG CTCGAGCTCT TCCACCAGAC CGTGTCGATG AAGTCGGTCG GCAACCTCAC CGACTTCGTG CGCAACCACA TGCTCGAGCC CTTCGAGGTC GGCCCGCGCA TCCAGGCCCT GATCGACCAC TTCGACGACC TCGACCGCGC CCACCAGGCG GTGCTCAAGG CGCAGCGCCA GGTCTCCCTG CTGGGCCCGC TGGTCGACGA CTGCCACCGC CACGACGCGC TCGCCGCCGA CATCAGCGCG CTGCGGCGCT GCCGCGAGGC GCTGCAGCCG CACTTTGCCG CGCGCAAGCT GGCCCTGCTC GACCACCGCC TCGAACTCGC CGCCGAGGAG TGGGCGCGCG CCGACGCCCA GGTCATGCGC CTCGACACGC TGCGCGAGGA ACAGGCCGGC CGCATCGACG CGCTCAAGCA GGCGATCAAC GCCAACGGCG GCGACCGCCT CGAGCGCCTC GCCGCCGAGA TCCGCAAGCA GGAGCAGGTG CGCGACGCGC GCCTGGCCAA GGCACGCCGC TACGGCGAGC TCGCCGCGGT GCTCGGCGCG TCCGCGGCCG CCGACGCCGA AGCCTTCGCC AGCCAGCGCC TGCACTTCGC CGCCAGCCGA GAGGAGGCGC GCAACCGCGA CGCCGAACTG CAGAACACGC TCACCGAGCA CGCCGTCACC CTGCGCCAGG GCAAGCTCGA GTACGACGCG CTGAGCGCCG AGATCGACAG CCTCAAACGT CGCCGCAGCA ACATCGACGA CCGCCAGATC CAGATCCGCG CCGCGCTGTG CGGCGCGCTC GGCATCGACG TCGAGGACAT GCCCTTCGCC GGCGAGCTGC TCCAGGTGCG CGAGGACGAG CGCGACTGGG AGGGCGCCGC CGAGCGCCTG CTGCGCGGCT TCGGCCTCGC CCTGCTGGTG CCCGACGCGC ACTACAAGGC GGTCGCCGAG TGGGTGGACG GCAACCACCT GCGCGGCCGC CTGGTGTACT TCCACGTCCG CCCGCCGCGC GCGGGCGAGC TGCCCGCGCT GCACCCCGAC TCGCTGGTGC GCAAGCTCGC GATCAAGCCC GACAGCCCGC ACTACGACTG GCTGGAACGC GAGCTCGCCC ACCGCTTCGA CGTCGCCTGC TGCGCCACGC AGGAGCAGTT CCGCCGCGAG ACGCGCGCGA TCACCCGCGC CGGCCAGATC AAGGACCCCA GCGGCCGCCA CGAGAAGGAC GACCGCCACG CCATCGCCGA CCGCAGCCGC TACGTGCTGG GCTGGAGCAA CACCGCCAAA ATCGAGGCCC TGGAAGCCCA GCGCCGCCAG CTCGAAGCCC GCCTCGGCGA AGTGGGCAGC CAGATCGGCC GCATCGAGGC CGAGCGCCGC ACCCTCGCCG GCCGCCTCGA CGCCCTCACC CGCCTGGAAG AATTCACCGC CTTCGACGAG CTCGACTGGC ACGGTGTCGC CGGCGCGATC GCCACGCTGG AGGACGAACG CCGCGCGCTC GAAGCCGCCT CCGACCTGCT CAAGACACTC AACCAGCAGC TCGCCGACCT GCAGCGGATC CGTGTCGACA CCGAGCGCGA GCTCGGCGCC GCGCGCGAAC GCCGCGCCAA GGTCGAGCAG CGCCAGGCCG ACGCCGAGGC GCTGCGCACC GCCACGCTCG CCCTCGTCGA CGCCGCGCCG ATCGACCCGG AGCTCGTCCC CCGATTGGAA GCCCTGTGCG CCGAGGTGCT CGGCGAGCAC CCGCTCACCG TCGAGTCCTG CGACAACCGC GAGCAGGAGG TGCGCACCGC GCTGCAGTCG CGCATCGACG CCGAGGATCT CCGCCTCAAG CGCCTGGCCG AGAAGATCAT CAAGGCGATG GCGGCCTTCA AGCAGCAGTT TCCCCTCGAG ACCGCCGAGA TCGACGCCAG CCTGGAGGCC GGCTTCGAGT ACGAGAAGCT GCTCGCGCAG CTCGACCGCG ACGACCTGCC GCGCTTCCTC GCCCGCTTCA AGGAGCTGCT CAACGTCAAC ACCATCAACG AGATCGCCAA CTTCAACGCC CAGCTCGCGC GCGAGCGCGA GACCATCAAG GAGCGCATCG CCCACATCAA CAAGTCGCTC GGCGAGATCG ACTACAACCC CGGGCGCTAC ATCGTGCTTG AATCGCAGGC GAGCCCCGAC GCCGAGATCC GCGACTTCCA GCAGGAGCTG CGCGCCTGCA CCGAGGGCGC GCTGACCGGG GCGGGCGAGG GCGACGACGA GCAGTATTCC GAGGCGCGCT TCCTGCGCGT CAAGGGCATC ATCGACCGCT TCCGCGGCCG CGAGGGCCTC TCCGACCAGG ATCGCCGCTG GACCGCCAAG GTCACCGACG TGCGCAACTG GTTCCTCTTC GCCGCCAGCG AACGCTGGCG CGAGGACGAC AGCGAGCACG AGCACTACTC GGACTCCGGC GGCAAGTCGG GCGGGCAGAA GGAGAAGCTC GCCTACACCA TCCTCGCCGC CAGCCTCGCC TACCAGTTCG GCCTGGAGTG GGGCGCGGTG CGCTCGCGCT CGTTCCGCTT CGTCGTCATC GACGAGGCCT TCGGCCGCGG CTCGGACGAA TCCGCGCAGT ACGGCCTGCG CCTGTTCGAG CAACTCAACC TGCAACTGCT GATCGTCACC CCGCTGCAGA AGATCCACAT CATCGAGCCC TTCGTCGCCA GCGTCGGCTT CGTGCACAAC GAGGGCGGCA GCGCCTCGAA GCTGAGGAAC CTGTCGATCG AGGAGTACCG CGCGCAGAAA GCCGAGATGC GGGCGGCGGC GCAGGCCGCG CCCCGCGCCG GCGGCGGCGC ATCCGCATCA TGA
|
Protein sequence | MNDPQSLGLD FVADDTLSGF RLQRLEVLNW GTFDRHVWTL QLDGRNGLLT GDIGSGKSTL VDAITTLLVP AQRIAYNKAA GADTRERSLR SYVLGHYKSE RNEVTGAARP IGLREPGSYS VILGVFHNAG YDQTVSLAQV FWFKEAQGQP ARFYLGAEHA LSITDDFSGF GPDITQLRKR LRGAGAEVLD SFPPYAAWFR RRFGIDNEQA LELFHQTVSM KSVGNLTDFV RNHMLEPFEV GPRIQALIDH FDDLDRAHQA VLKAQRQVSL LGPLVDDCHR HDALAADISA LRRCREALQP HFAARKLALL DHRLELAAEE WARADAQVMR LDTLREEQAG RIDALKQAIN ANGGDRLERL AAEIRKQEQV RDARLAKARR YGELAAVLGA SAAADAEAFA SQRLHFAASR EEARNRDAEL QNTLTEHAVT LRQGKLEYDA LSAEIDSLKR RRSNIDDRQI QIRAALCGAL GIDVEDMPFA GELLQVREDE RDWEGAAERL LRGFGLALLV PDAHYKAVAE WVDGNHLRGR LVYFHVRPPR AGELPALHPD SLVRKLAIKP DSPHYDWLER ELAHRFDVAC CATQEQFRRE TRAITRAGQI KDPSGRHEKD DRHAIADRSR YVLGWSNTAK IEALEAQRRQ LEARLGEVGS QIGRIEAERR TLAGRLDALT RLEEFTAFDE LDWHGVAGAI ATLEDERRAL EAASDLLKTL NQQLADLQRI RVDTERELGA ARERRAKVEQ RQADAEALRT ATLALVDAAP IDPELVPRLE ALCAEVLGEH PLTVESCDNR EQEVRTALQS RIDAEDLRLK RLAEKIIKAM AAFKQQFPLE TAEIDASLEA GFEYEKLLAQ LDRDDLPRFL ARFKELLNVN TINEIANFNA QLARERETIK ERIAHINKSL GEIDYNPGRY IVLESQASPD AEIRDFQQEL RACTEGALTG AGEGDDEQYS EARFLRVKGI IDRFRGREGL SDQDRRWTAK VTDVRNWFLF AASERWREDD SEHEHYSDSG GKSGGQKEKL AYTILAASLA YQFGLEWGAV RSRSFRFVVI DEAFGRGSDE SAQYGLRLFE QLNLQLLIVT PLQKIHIIEP FVASVGFVHN EGGSASKLRN LSIEEYRAQK AEMRAAAQAA PRAGGGASAS
|
| |