Gene Tmz1t_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2800 
Symbol 
ID7873209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3030688 
End bp3032061 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content48% 
IMG OID643699722 
ProductAnkyrin 
Protein accessionYP_002889777 
Protein GI237653463 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATTC TGCTGCTGGG TTTCATCGGC TGGACTTTGT TGTGCTGCGC ATTGATCGCT 
AGAAAGGCAG GCTATTCAGG ATGGTGGTCT ATTTTAATGG TCATTCCAAT AATAAGCTTG
GTTTTGATGT GGCGTTTCCC GTTCTTGAAA TGGCCAGCGC TCAGGCGACA ACCGCAGCCT
GAATCCGTAC GTGAATCGTC CGCAAGAATG GCCAAGGGAG CTTCACAGCA CGATGGGGTG
ACGTACGTCG TACCTGCGAT TGGGGTTCAG TTCGAGGCAG CATCTCAAAA GAATGATGAG
GCTGTGCTTG ATGCCCTTTT AGAAGTTGAT CTCGTGCCTG CTGAGGTGAG TCTTGACAGG
ATTTACGAAG AAGTGGCTGA GGAACTAGAG AGGGAAAGGG TCGATAAAGG GCTGTGGACT
AGGCTGTACG CGGAATTTGA TGGCGATGAA AGAAAAGTAA AGGTTGGGTA CATCAAGGCC
CGTGCAGAAA AGCTGCTTCG TGAAAAAGGC GAAGAGATTC GGATTGCCCG CTTGCGTCAT
GAGGAGAAGC TAAGGGCTAT AAGCCAACTG AAAATGAAAC GAGATTATAT TCGTGAGAAT
ATTGATCGTG CTTCGTCAGA CGGTCGCGCA GATGCTGGCC TCGAGGGTCT TTCATCAACA
CACACTGCAA CGTTGTTTCT AAATTCAGTT AGGTTCAGTC GCATTGACGA GGCAAGGTCT
TGGCTTGATG AAAACCCAGC GTTAGTCGAC GTTAAAGATA GCGGGGGTAT GACTGCGCTT
CACATAGCCG CACGAGAAGG TTATGCGGAT ATGATTAAAT TTCTGATCCA GAGAGGAGCT
TCTCTGACGG CTAGAAATTT GGAAGGGAAG GTTCCGTTAG ATCTGTCTGC CGGGTTCGGT
GCTCAATGGA TAAATGAAGT GCTTGGATCA ACGCAAGTCC GCCAAAAAAG AGACAAAGAA
AATTCGGAAA ACAGTTCAGT GAGAAAGGGG GTTGATATAG CCCTTATGAA TGTCAGGGCC
TCTAGAAAGC TTCTTACTGA AGACGATATG ATCCGTGCGC TCCGAGAGAA AGGTTCAAGC
TTGGCAAAAA ATTTCTGGAG TGACGTAAAA GATGGGAATC ATGTATTCAT CAGCCGGGAA
CTAGATAGAA ATCCGTGGCT CGCAGCAGTT GCCTTCGATT ATGGCGAAAC GGCACTGCAC
AAAGCGGTAG GCCGTAAAGA TCTCTGGTTA ATTGAACATC TTCTAATTGC AGGTGCTATG
CCGGATAAGG CTGCGGATTA CGGGAAGTCG GCGCTTGACT TAGCGAGGGC GTCTGGGGAT
GGCGACATTG TCACGCTTCT AGAGTGCTGT TCTGAATTTG ATGCTAAGTC ATGA
 
Protein sequence
MDILLLGFIG WTLLCCALIA RKAGYSGWWS ILMVIPIISL VLMWRFPFLK WPALRRQPQP 
ESVRESSARM AKGASQHDGV TYVVPAIGVQ FEAASQKNDE AVLDALLEVD LVPAEVSLDR
IYEEVAEELE RERVDKGLWT RLYAEFDGDE RKVKVGYIKA RAEKLLREKG EEIRIARLRH
EEKLRAISQL KMKRDYIREN IDRASSDGRA DAGLEGLSST HTATLFLNSV RFSRIDEARS
WLDENPALVD VKDSGGMTAL HIAAREGYAD MIKFLIQRGA SLTARNLEGK VPLDLSAGFG
AQWINEVLGS TQVRQKRDKE NSENSSVRKG VDIALMNVRA SRKLLTEDDM IRALREKGSS
LAKNFWSDVK DGNHVFISRE LDRNPWLAAV AFDYGETALH KAVGRKDLWL IEHLLIAGAM
PDKAADYGKS ALDLARASGD GDIVTLLECC SEFDAKS