Gene Tmz1t_3574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3574 
Symbol 
ID7873079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3917078 
End bp3918220 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content72% 
IMG OID643700514 
ProductPeptidoglycan-binding domain 1 protein 
Protein accessionYP_002890544 
Protein GI237654230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGCA TCATTCCCTG CACGGATTGC GCCGAGAAGA CGCTCGACGT CGAGCAGCTC 
GGCTTCCGGG TCACCAGCTG CGATCCCCAT CCCGAGCGGC CCGGGTTCTG CGTGCTGCGC
TTCGAGGATC GCAGCGCCAC GCCGGCGGCC GGGGCAAGCC TCGCAGCCCC CGCCGCAGCG
GCGCGCGCGG GGAGGGGGGC GGTCGCCGGT GGCGTCACCG CCACCCAGGC GGCGGTCGCC
AAGGCGATCG TCAACCTGTT CGAGACCGGG GAGGTGCTGG GCCAGTACGG GAAGGTGACG
CTGATCCCGG GCGATACCGG CCACCTGACC TTCGGCCGAT CGCAGACCAC GCTCGGCTCT
GGCAACCTCG CCAAGCTGCT GCAGCAATAC TGCGCCAACC CCGGGGCACG CTTCGCCGGC
CGGCTGGCGT CCTACCTGCC GCGCTTCCTG GCCATCGACG AGAGCCTCGA CGACGATCCC
CGCCTGCACA ACGTGCTGCG TGCGACCGCC GACGATCCGG TGATGCGCGA TACGCAGGAT
GCGTTTTTCG ATCGGACCTA CTGGGAGCCT GCGCTGCGCG CGGCCGCAAG CTTGGGCGTG
CACACCCCGC TCGGCGTGGC GGTGGTGTAC GACAGCGCCG TGCACGGCTC CTGGCTGGCG
ATGCGCGACC GCACCACGCG CGCGGTCGGC GAGCCCGCGG CGGTGGGCGA GCAGGCCTGG
ATCGACGCCT ATGTGCGCAC GCGACGAGCC TGGCTGGAGG GCCACGCGCG CGCCGACCTG
CGCCAGACGG TGTATCGCAT GGAAGCGTTC GGACGCCTCA TCGACCAGGG CTTCTGGGGG
CTCGAGATGC CGCTCGTGGT GCGCGGCAGG GAGATCTCGA GCGTGACGCT CGCCGCCTTG
CCGCCCGGCT GCTACGACGG GCCGCAGCCG GGTTCGCGCC CCTTGACGCT GGCGACCCCG
CTGGCGCGCG GGCTGGATGT CCGCCTGCTG CAGCTCGGCC TGTCCGACCG CGGCGTGGAC
ATCCTCGCCG ATGGCATCTT CGGGCGGACC AGCTTCAACC TGCTCAAGGC CTGGCAGGCG
CAGCACGGGC TGGCGGCCAC CGGCATCGCG GACCCCGCCC TGATCGGCGA GTTGACGGCC
TGA
 
Protein sequence
MERIIPCTDC AEKTLDVEQL GFRVTSCDPH PERPGFCVLR FEDRSATPAA GASLAAPAAA 
ARAGRGAVAG GVTATQAAVA KAIVNLFETG EVLGQYGKVT LIPGDTGHLT FGRSQTTLGS
GNLAKLLQQY CANPGARFAG RLASYLPRFL AIDESLDDDP RLHNVLRATA DDPVMRDTQD
AFFDRTYWEP ALRAAASLGV HTPLGVAVVY DSAVHGSWLA MRDRTTRAVG EPAAVGEQAW
IDAYVRTRRA WLEGHARADL RQTVYRMEAF GRLIDQGFWG LEMPLVVRGR EISSVTLAAL
PPGCYDGPQP GSRPLTLATP LARGLDVRLL QLGLSDRGVD ILADGIFGRT SFNLLKAWQA
QHGLAATGIA DPALIGELTA