Gene Tmz1t_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0051 
Symbol 
ID7083434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp54144 
End bp57566 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content71% 
IMG OID643697099 
Productconserved hypothetical cytosolic protein 
Protein accessionYP_002353748 
Protein GI217968514 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.609156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACC CGCAATCCCT CGGCCTCGAC TTCGTCGCCG ACGACACCCT GTCCGGCTTC 
CGCCTGCAGC GCCTGGAGGT GCTCAACTGG GGCACCTTCG ACCGCCACGT GTGGACGCTG
CAGCTCGACG GCCGCAACGG GCTGCTCACC GGCGACATCG GCTCGGGCAA GTCCACCCTG
GTCGACGCGA TCACCACCCT GCTCGTCCCC GCGCAGCGCA TCGCCTACAA CAAGGCCGCC
GGCGCCGACA CGCGCGAGCG CAGCCTGCGC TCCTACGTGC TCGGCCACTA CAAGTCCGAG
CGCAACGAGG TCACCGGCGC CGCGCGCCCG ATCGGCCTGC GCGAGCCCGG CAGCTACTCG
GTGATCCTCG GCGTATTCCA CAACGCCGGC TACGACCAGA CCGTGAGCCT GGCCCAGGTG
TTCTGGTTCA AGGAGGCGCA AGGCCAGCCG GCGCGCTTCT ACCTCGGCGC CGAGCACGCG
CTGTCGATCA CGGACGATTT CTCCGGCTTC GGCCCCGACA TCACGCAGCT TCGCAAACGC
CTGCGCGGCG CCGGGGCCGA GGTGCTCGAC AGCTTCCCGC CCTACGCAGC GTGGTTCCGC
CGCCGCTTCG GCATCGACAA CGAGCAGGCG CTCGAGCTCT TCCACCAGAC CGTGTCGATG
AAGTCGGTCG GCAACCTCAC CGACTTCGTG CGCAACCACA TGCTCGAGCC CTTCGAGGTC
GGCCCGCGCA TCCAGGCCCT GATCGACCAC TTCGACGACC TCGACCGCGC CCACCAGGCG
GTGCTCAAGG CGCAGCGCCA GGTCTCCCTG CTGGGCCCGC TGGTCGACGA CTGCCACCGC
CACGACGCGC TCGCCGCCGA CATCAGCGCG CTGCGGCGCT GCCGCGAGGC GCTGCAGCCG
CACTTTGCCG CGCGCAAGCT GGCCCTGCTC GACCACCGCC TCGAACTCGC CGCCGAGGAG
TGGGCGCGCG CCGACGCCCA GGTCATGCGC CTCGACACGC TGCGCGAGGA ACAGGCCGGC
CGCATCGACG CGCTCAAGCA GGCGATCAAC GCCAACGGCG GCGACCGCCT CGAGCGCCTC
GCCGCCGAGA TCCGCAAGCA GGAGCAGGTG CGCGACGCGC GCCTGGCCAA GGCACGCCGC
TACGGCGAGC TCGCCGCGGT GCTCGGCGCG TCCGCGGCCG CCGACGCCGA AGCCTTCGCC
AGCCAGCGCC TGCACTTCGC CGCCAGCCGA GAGGAGGCGC GCAACCGCGA CGCCGAACTG
CAGAACACGC TCACCGAGCA CGCCGTCACC CTGCGCCAGG GCAAGCTCGA GTACGACGCG
CTGAGCGCCG AGATCGACAG CCTCAAACGT CGCCGCAGCA ACATCGACGA CCGCCAGATC
CAGATCCGCG CCGCGCTGTG CGGCGCGCTC GGCATCGACG TCGAGGACAT GCCCTTCGCC
GGCGAGCTGC TCCAGGTGCG CGAGGACGAG CGCGACTGGG AGGGCGCCGC CGAGCGCCTG
CTGCGCGGCT TCGGCCTCGC CCTGCTGGTG CCCGACGCGC ACTACAAGGC GGTCGCCGAG
TGGGTGGACG GCAACCACCT GCGCGGCCGC CTGGTGTACT TCCACGTCCG CCCGCCGCGC
GCGGGCGAGC TGCCCGCGCT GCACCCCGAC TCGCTGGTGC GCAAGCTCGC GATCAAGCCC
GACAGCCCGC ACTACGACTG GCTGGAACGC GAGCTCGCCC ACCGCTTCGA CGTCGCCTGC
TGCGCCACGC AGGAGCAGTT CCGCCGCGAG ACGCGCGCGA TCACCCGCGC CGGCCAGATC
AAGGACCCCA GCGGCCGCCA CGAGAAGGAC GACCGCCACG CCATCGCCGA CCGCAGCCGC
TACGTGCTGG GCTGGAGCAA CACCGCCAAA ATCGAGGCCC TGGAAGCCCA GCGCCGCCAG
CTCGAAGCCC GCCTCGGCGA AGTGGGCAGC CAGATCGGCC GCATCGAGGC CGAGCGCCGC
ACCCTCGCCG GCCGCCTCGA CGCCCTCACC CGCCTGGAAG AATTCACCGC CTTCGACGAG
CTCGACTGGC ACGGTGTCGC CGGCGCGATC GCCACGCTGG AGGACGAACG CCGCGCGCTC
GAAGCCGCCT CCGACCTGCT CAAGACACTC AACCAGCAGC TCGCCGACCT GCAGCGGATC
CGTGTCGACA CCGAGCGCGA GCTCGGCGCC GCGCGCGAAC GCCGCGCCAA GGTCGAGCAG
CGCCAGGCCG ACGCCGAGGC GCTGCGCACC GCCACGCTCG CCCTCGTCGA CGCCGCGCCG
ATCGACCCGG AGCTCGTCCC CCGATTGGAA GCCCTGTGCG CCGAGGTGCT CGGCGAGCAC
CCGCTCACCG TCGAGTCCTG CGACAACCGC GAGCAGGAGG TGCGCACCGC GCTGCAGTCG
CGCATCGACG CCGAGGATCT CCGCCTCAAG CGCCTGGCCG AGAAGATCAT CAAGGCGATG
GCGGCCTTCA AGCAGCAGTT TCCCCTCGAG ACCGCCGAGA TCGACGCCAG CCTGGAGGCC
GGCTTCGAGT ACGAGAAGCT GCTCGCGCAG CTCGACCGCG ACGACCTGCC GCGCTTCCTC
GCCCGCTTCA AGGAGCTGCT CAACGTCAAC ACCATCAACG AGATCGCCAA CTTCAACGCC
CAGCTCGCGC GCGAGCGCGA GACCATCAAG GAGCGCATCG CCCACATCAA CAAGTCGCTC
GGCGAGATCG ACTACAACCC CGGGCGCTAC ATCGTGCTTG AATCGCAGGC GAGCCCCGAC
GCCGAGATCC GCGACTTCCA GCAGGAGCTG CGCGCCTGCA CCGAGGGCGC GCTGACCGGG
GCGGGCGAGG GCGACGACGA GCAGTATTCC GAGGCGCGCT TCCTGCGCGT CAAGGGCATC
ATCGACCGCT TCCGCGGCCG CGAGGGCCTC TCCGACCAGG ATCGCCGCTG GACCGCCAAG
GTCACCGACG TGCGCAACTG GTTCCTCTTC GCCGCCAGCG AACGCTGGCG CGAGGACGAC
AGCGAGCACG AGCACTACTC GGACTCCGGC GGCAAGTCGG GCGGGCAGAA GGAGAAGCTC
GCCTACACCA TCCTCGCCGC CAGCCTCGCC TACCAGTTCG GCCTGGAGTG GGGCGCGGTG
CGCTCGCGCT CGTTCCGCTT CGTCGTCATC GACGAGGCCT TCGGCCGCGG CTCGGACGAA
TCCGCGCAGT ACGGCCTGCG CCTGTTCGAG CAACTCAACC TGCAACTGCT GATCGTCACC
CCGCTGCAGA AGATCCACAT CATCGAGCCC TTCGTCGCCA GCGTCGGCTT CGTGCACAAC
GAGGGCGGCA GCGCCTCGAA GCTGAGGAAC CTGTCGATCG AGGAGTACCG CGCGCAGAAA
GCCGAGATGC GGGCGGCGGC GCAGGCCGCG CCCCGCGCCG GCGGCGGCGC ATCCGCATCA
TGA
 
Protein sequence
MNDPQSLGLD FVADDTLSGF RLQRLEVLNW GTFDRHVWTL QLDGRNGLLT GDIGSGKSTL 
VDAITTLLVP AQRIAYNKAA GADTRERSLR SYVLGHYKSE RNEVTGAARP IGLREPGSYS
VILGVFHNAG YDQTVSLAQV FWFKEAQGQP ARFYLGAEHA LSITDDFSGF GPDITQLRKR
LRGAGAEVLD SFPPYAAWFR RRFGIDNEQA LELFHQTVSM KSVGNLTDFV RNHMLEPFEV
GPRIQALIDH FDDLDRAHQA VLKAQRQVSL LGPLVDDCHR HDALAADISA LRRCREALQP
HFAARKLALL DHRLELAAEE WARADAQVMR LDTLREEQAG RIDALKQAIN ANGGDRLERL
AAEIRKQEQV RDARLAKARR YGELAAVLGA SAAADAEAFA SQRLHFAASR EEARNRDAEL
QNTLTEHAVT LRQGKLEYDA LSAEIDSLKR RRSNIDDRQI QIRAALCGAL GIDVEDMPFA
GELLQVREDE RDWEGAAERL LRGFGLALLV PDAHYKAVAE WVDGNHLRGR LVYFHVRPPR
AGELPALHPD SLVRKLAIKP DSPHYDWLER ELAHRFDVAC CATQEQFRRE TRAITRAGQI
KDPSGRHEKD DRHAIADRSR YVLGWSNTAK IEALEAQRRQ LEARLGEVGS QIGRIEAERR
TLAGRLDALT RLEEFTAFDE LDWHGVAGAI ATLEDERRAL EAASDLLKTL NQQLADLQRI
RVDTERELGA ARERRAKVEQ RQADAEALRT ATLALVDAAP IDPELVPRLE ALCAEVLGEH
PLTVESCDNR EQEVRTALQS RIDAEDLRLK RLAEKIIKAM AAFKQQFPLE TAEIDASLEA
GFEYEKLLAQ LDRDDLPRFL ARFKELLNVN TINEIANFNA QLARERETIK ERIAHINKSL
GEIDYNPGRY IVLESQASPD AEIRDFQQEL RACTEGALTG AGEGDDEQYS EARFLRVKGI
IDRFRGREGL SDQDRRWTAK VTDVRNWFLF AASERWREDD SEHEHYSDSG GKSGGQKEKL
AYTILAASLA YQFGLEWGAV RSRSFRFVVI DEAFGRGSDE SAQYGLRLFE QLNLQLLIVT
PLQKIHIIEP FVASVGFVHN EGGSASKLRN LSIEEYRAQK AEMRAAAQAA PRAGGGASAS