Gene Tmz1t_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1862 
Symbol 
ID7084285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2095527 
End bp2098787 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content73% 
IMG OID643698885 
Producthypothetical protein 
Protein accessionYP_002355510 
Protein GI217970276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0997362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGCCT ACGCACGTCC CCCCGCACGC GTGAGCCAGC CCGGCGAGAA CATCGAGCGC 
GAGGCCGAGG CGCACGCCGC CGCCGTCGCC CCGACGAACT CGCCCCATCC GACGCCGGAC
TCGGCCCCCG CGCCAGTACC GACCCGCAGC ACCGAAGACA GCGCGGGCGC GTCCTCGGCT
GCGAACGTCG GCTCCACCCC CCCACCCGCA TCCGCCTCCG CCTCCGCCGA CGCGAACTCG
CCCTCCGCGC GCCTTCCCGC CTATGCCGCC GGCCGTGTCG AAGCGCGCCG CGGGCGCGGC
GATCCGCTGC CGGCCGATGC CCGCGCCCCG CTCGAGGACC ACTTCGGTCG CGACTTCGCC
GACGTGCGCC TCCATTCGGA CGCCGAAGCG GCGCGCCTCA CCGCTGCGCT CGGCGCACGC
GCGTTCACGT CGGGGCGCGA CATCTACCTC GCACCGGGCA CCGTCGCCTC CGACACCGAG
GAGGGGCGTC ACCTCCTCGC GCACGAACTC GCCCACGTCG TGCAGCAGGA TGGCCAGGCC
GCGGCGACAC TCGCGCGCGC GATCGAACCG CCCCCTCCGG CCCGCTACCA CGAGACCGAG
CGCAGCGCTG CCGCCCTCGC CGCACTGGAT CGCCTCGAGA TCCCCGCGGT GAAGGCGCGC
CACCTGCCGC TCTACAGCGG CCTCGCGCAG GCGCACCAGC TCAAGCGCGT GCGTGGCTAC
GGACGCGAGG GCGCCGACCA GCGCTCGGTA TGGAGCCGCC AGGTCGAGGT CGGCGCCGAC
GCCGTGCGCG AACGCCTGAC CGAACGCGGC ATCCCGGTTC CCGCCGACCC CAGCGGCCGC
GTACGCCTGA AACTGCATCG CGGCGCACGT ACGACGTCGA AACGCCTGTC CGAGCTGCAG
ACCCTGCTGC GCATCCCCGA ATGGGACCGC GAGGGCCGCA GGCGCGATTT CCAGGTCGAC
CACATCGTCG AACTGCAGGT CTCCGGCCAG ATGGGCTCGG GGGTGGGCAA CAGCGTCGAG
AACATGGAGC TCCTCGACCA GCCCTCGAAC TCGAGCTCGG GCGGCACGAT CCGCAGCAGC
ATCTATCGCA AGCTCGACGG CTTCCTCGAT ACCCTGGAGC CTCGACCCGA CCGAGCAAGC
TTCCTGCGCG GACACGACCT CGTCTTCACC AGCGTCGCAG CAACCGGCGC GGGCTCGGCG
GCGACGGGCT CGGCGTGGTG GACCAGGGCC GAGATCGAGC AGGCCCACAC CCTCGGCACC
GCGAGCGCCC CCACCGCCGC CGAAGACGGC GACCGCGCCG GCAGCGCCGA GGAGTTCATC
CTCAGCGCCG CCCCCGGCGG CATCGCCGTG GGCCGCTTCC GCCACAGCCC CGGTGCCGCG
CCCACGATCG GCGCCACCCA GGCGCGCGCG CTCGCCGGCC TGCGCATCAC GGCGATCGCG
CTCACCGACC TTACCGGCAC GCCCGACGGC CCCGTCGGTA CGCTCTCGGC AGAATGGGAC
CTGCCCGCCG ACTGGCAACC CGCCAATCCT GCGATCACGA TCTCGCTCCA GGGCGATGGC
GAGTACCGCG GCTACCCGTC GGCGCTTCCC GGCCTCGATC TCGAGTACCG CCATCTCAGC
CCGGTGAGTT TCACCCGCAT CTCCACCGAA GACGGCGAGC TGTACGCCGA GGGCACGCTC
ACGCCCTCGA TCCCCATCCT CGCAGCTCCG CTGACGGTGC AACTGCGCGG ACGCGAGCTC
GGCTTCGCGC TCGACTACGG TCCGGAACAG GTCAGCCTGC CGATCCCGCG CACGACGATC
GACGACGCCT GGGTCAGCGT CTTCTATTCG ACCACCCGCG GACTGGGCGT GGGAGGCGAC
ATCCTGTATT CGATCGAGGG GGCAGGCAGC GGCGAGCTCG GCGCCTCGGT CTCGACCGGC
GGGGGCGTCG CCTTCGAGGG CGGCTTCACT TTCGATCCCG CCCTCTTCGA CCGCGCTCGC
GTCCGCGCCT GGTGGCGTGA TGGTCGCCTG GGCGCCGAAG GCACGATCGG CATCGATACC
CCGGACAAGA TCCGTGGCAT CCGCAGCGCG ACCGCCTCGG TCCGTGTCGA TGAAGGGCGC
TGGTCGTTCA ACGGCAGTGC CGAACTCTCC GTGCCCGGTC TCAGCCAGGC CTCGATCGCG
ATCCGCCAGG GCGAGGGCGG ACTCGAGCTC GCGGGCGACG TCGCCCTCGC CACCAACCCC
GCCATCCGCT CCGGCACCCT GCACGTCGAA TGCGCGCAGA CCGACGGCGA GTGGAAGGTC
GCGGCGAGCG GCACCGCTCA GCCTGCGATC CCCGGCGTCG ACGCGGAGCT CGCCGTCACC
TATGCGGACG GCGCCTTCGA CGCGCGCTTC TCCGGCGCCT TCCGCCGCGG CATGCTCTCC
GGCCAGCTCA GCGTCGGCGC CACCAACCGC GCCGTCGCTG CGGACGGGAG CCCCGGCGGC
CCGCCCAGCG CTCCGGATGC GCCCATCGTG GTCTACGGCA GCGGCTCCGC GACCGTGCGG
ATCGCCCCCT GGCTGCAGGG CAGCGCGGGC CTGCGGGTCG CTCCCGACGG CGAGCTCACC
GTGTCCGGCG AGATCGCCCT GCCCGATTCG CTGGAGATCT TCTCCCGGCT GGAGTACGAC
AAGCGCCTGT TCGGCATGTC GACGCAGATC CCCATCGTCC CCGGCGTGGT CGCCGAGGTC
GGCGGCAATC TCAGTGCCAA CGCCAGCGTC GGCCCCGGGG CACTGGACCG GCTCGCTGTC
CGCATCGAAT ACAACCCCGC ACACGAGGAT GACACCCACG TCACCGGCGA GGCCCACCTC
GAGGTTCCGG CGCAGGCAGG CCTGCGGCTC GGCGCGCGTG CCGGCGTCGG CCTCGGCATC
ACCGGCGCAA GTGCCACCGG GGGCCTCGAG ATCGGCGGCG CCCTCGGCAT CGCAGGGGCT
GCCGAGGCTG GCGTGCGCAT CGACTGGATG CCCTCGCGCG GCCTCGAGAT CGACGCCGAG
GCCGCGCTCC ACGCCCAGCC GCGCTTCCGC TTCGACGTTT CCGGCTACGT CGCCGTCACC
GTGCTCGGCG CCAGCCTCTA CGACGAGCGC ATCGAGCTCG CCGCCTATGA GCTCGGCTCT
GGCCTCGAGT TCGGCGTCCG CTTCCCCGTC ACCTACCGTG AGGGAGAACC CTTCGACCTC
TCCCTCGACG ACCTCGAGTT CCAGGTCCCC GAGGTGGATC CGGCGGCGAT GATCAGGCAG
CTGGGGGAGA CGATCTTCTG A
 
Protein sequence
MPAYARPPAR VSQPGENIER EAEAHAAAVA PTNSPHPTPD SAPAPVPTRS TEDSAGASSA 
ANVGSTPPPA SASASADANS PSARLPAYAA GRVEARRGRG DPLPADARAP LEDHFGRDFA
DVRLHSDAEA ARLTAALGAR AFTSGRDIYL APGTVASDTE EGRHLLAHEL AHVVQQDGQA
AATLARAIEP PPPARYHETE RSAAALAALD RLEIPAVKAR HLPLYSGLAQ AHQLKRVRGY
GREGADQRSV WSRQVEVGAD AVRERLTERG IPVPADPSGR VRLKLHRGAR TTSKRLSELQ
TLLRIPEWDR EGRRRDFQVD HIVELQVSGQ MGSGVGNSVE NMELLDQPSN SSSGGTIRSS
IYRKLDGFLD TLEPRPDRAS FLRGHDLVFT SVAATGAGSA ATGSAWWTRA EIEQAHTLGT
ASAPTAAEDG DRAGSAEEFI LSAAPGGIAV GRFRHSPGAA PTIGATQARA LAGLRITAIA
LTDLTGTPDG PVGTLSAEWD LPADWQPANP AITISLQGDG EYRGYPSALP GLDLEYRHLS
PVSFTRISTE DGELYAEGTL TPSIPILAAP LTVQLRGREL GFALDYGPEQ VSLPIPRTTI
DDAWVSVFYS TTRGLGVGGD ILYSIEGAGS GELGASVSTG GGVAFEGGFT FDPALFDRAR
VRAWWRDGRL GAEGTIGIDT PDKIRGIRSA TASVRVDEGR WSFNGSAELS VPGLSQASIA
IRQGEGGLEL AGDVALATNP AIRSGTLHVE CAQTDGEWKV AASGTAQPAI PGVDAELAVT
YADGAFDARF SGAFRRGMLS GQLSVGATNR AVAADGSPGG PPSAPDAPIV VYGSGSATVR
IAPWLQGSAG LRVAPDGELT VSGEIALPDS LEIFSRLEYD KRLFGMSTQI PIVPGVVAEV
GGNLSANASV GPGALDRLAV RIEYNPAHED DTHVTGEAHL EVPAQAGLRL GARAGVGLGI
TGASATGGLE IGGALGIAGA AEAGVRIDWM PSRGLEIDAE AALHAQPRFR FDVSGYVAVT
VLGASLYDER IELAAYELGS GLEFGVRFPV TYREGEPFDL SLDDLEFQVP EVDPAAMIRQ
LGETIF