Gene Tmz1t_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0997 
Symbol 
ID7083731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1097750 
End bp1099408 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content71% 
IMG OID643698019 
Producthypothetical protein 
Protein accessionYP_002354659 
Protein GI217969425 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGAGC TGACGCCGCA GGACATGGCT GCCAAGCTTC TGGCCACCGG CTTCGAGCGC 
AGCGGCCCTT CGGCCGCGAC CTTGAGCGAC CCCATCGCCG ACACGCCGAT GGTGGTGACG
CTGGACCAGT TGCGGCCCTA CGACCACGAC CCGCGCGTGA CGCGCAACCC GGCCTATGCG
GAGATCAAGG CGTCCATCCG CGAACGCGGG CTGGACGCGC CCCCCGCGAT CACGCGCAGG
CCGGGCGAGG CGCATTACAT CATTCGCAAC GGCGGCAACA CGCGGCTCGC GATCCTGCGC
GAGTTGTGGA GCGAGACCAA GGAGGAACGC TTCTTCCGCA TTGCGTGCCT GTTCCGCCCG
TGGCCGGCGC GCGGCGAAAT CGTGGCGCTG ACTGGACATC TGGCCGAGAA CGAGCTGCGC
GGCGGGCTGA GCTTCATCGA GCGGGCCTTG GGCATCGAGA AGGCGCGCGA GTTCTACGAG
CAGGAAAGCG GCCAGGCGCT GTCACAGAGC GAACTCGCGC GGCGGCTGAC GGCCGACGGC
TATCCGGTGC CGCAGTCACA CATCAGCCGC ATGAACGATG CGGTGCGCTA TCTGCTGCCG
GCGATCCCGA CGCTGCTGTA CGGCGGATTG GGCCGGCATC AGGTGGACCG GCTCGCGGTG
CTGCGCAAGG CGTGCGAGCG CACCTGGGAG CGGCGTGCGC TGGGCCGCAC CGTGGCCGTG
GACTTCGCCA CCTTGTTTCA GGACGTGCTG ACGCAGTTCG ACACACAGCC GGACGACTTC
TCGCCGCAGC GGGTGCAGGA CGAGCTGGTC GGCCAGATGG CCGAGCTGCT GGAGGCGGAC
TACGACACGC TGGCGCTGGA GATCAACGAC AGCGAAAGCC GCCAGCGTGC GCTGACCAGC
GAACCGGCGG CGCCGACGCC ACCGGCAGCG CCTGTCGTGC CTGCTGATCC TCCCCCGCCG
GTCTCCGCGC CTCAGCAGCC ACCCGCCTCG TCTGTGCCGC GCGACACCAC GCCGGTCGCG
CCTTCGGCGC CAGCAGCGAC ACCGCCTGCA TCGCCCGAAG CGCCGGAGGA CCAGCACGGG
GAACGCGAAG AGCGCCTGCA AGGGCACATC GTGACACCGG CACCGACCAC CGAGCGCCTG
CAGTCCATCC AGCGGATGGT CGCGGACCAG CTCGGCGACA AGCTGCCCGA CTTCGAGGCC
GATGCGCTGC GTGCGATCCC CGTGCAGGTC GGCGGGCTCT ATCCCATCTC GGACGTCTGG
TACGTCGAGC CGGGGCTGGA CGTGCCGGAT CGCCTGCGCG TGCACATCGC GCAGTTCGCG
CGCGAGATCG GCGAGGAAGC GGCGGTCGGC GACCACATCG AGGCCAGCGT CGGCGGCATC
GGCTTCGTCT GCGCGGCGCC GGTTGTGGGC CAGGCGAAGG CGCTGCCGGC GTTCGCGCGG
GCGGTGCTGA CCCTGCTGCA TGTGCTGAGT GCGGCTCCGC CCTCCGCGAA CGGATTGGAC
CGCGCGCGGC TGGCCGACGA GCTGGCGGCG CTGCTGCATG GCCACGGCGG CTCGGCCACA
CGCCTGAGCG ATGCTGCGCT GGTGAAGCTG TTCCGTCTGC TGCGCCTGGC GCGCCGGCTG
CTGGATCTGG AAGCCGGCGT AGCGAGCCAG GATTCCTGA
 
Protein sequence
MAELTPQDMA AKLLATGFER SGPSAATLSD PIADTPMVVT LDQLRPYDHD PRVTRNPAYA 
EIKASIRERG LDAPPAITRR PGEAHYIIRN GGNTRLAILR ELWSETKEER FFRIACLFRP
WPARGEIVAL TGHLAENELR GGLSFIERAL GIEKAREFYE QESGQALSQS ELARRLTADG
YPVPQSHISR MNDAVRYLLP AIPTLLYGGL GRHQVDRLAV LRKACERTWE RRALGRTVAV
DFATLFQDVL TQFDTQPDDF SPQRVQDELV GQMAELLEAD YDTLALEIND SESRQRALTS
EPAAPTPPAA PVVPADPPPP VSAPQQPPAS SVPRDTTPVA PSAPAATPPA SPEAPEDQHG
EREERLQGHI VTPAPTTERL QSIQRMVADQ LGDKLPDFEA DALRAIPVQV GGLYPISDVW
YVEPGLDVPD RLRVHIAQFA REIGEEAAVG DHIEASVGGI GFVCAAPVVG QAKALPAFAR
AVLTLLHVLS AAPPSANGLD RARLADELAA LLHGHGGSAT RLSDAALVKL FRLLRLARRL
LDLEAGVASQ DS