Gene Tmz1t_2987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2987 
Symbol 
ID7874377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3240741 
End bp3241796 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content73% 
IMG OID643699908 
Productpeptidase U32 
Protein accessionYP_002889963 
Protein GI237653649 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.654873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCG TCGCCCCCAT CCGGCAACTC GACGAGATCG CCGCGCTGGC GCGCGCCGGG 
GCGGACGAAC TCTATTGCGG GGTGACGCCG CGCGAGTGGG CCGAGCGCTT CGGTGGCGCC
AGCGCCAACC GCCGTCCCGG AGGCAACCTC CCCTCGCTGG CCGCGCTCGC CGAGGCGGTG
GCGCTCGCCC ACCGCAATGG CGTACAGCTC TCCCTGGTGC TCAACGCACA GCAGTATTCG
GTCGAACAGA TCGAGTTCGC GCTCGCGATC GCGCACCGCT ACGTCGACAT GGGGGGCGAT
GCGGTGATCG CGAGCGACCC CGGCCTGCTC CTGGCGCTGG CCGAAGCCGA GCCGGAGCTG
CGCATCCACG TCAGCTCGGT GGCCACCTGC CGCAATGCCG ACGGCGCCCG CCTCTACCGC
GAGCTCGGCG CGCGCCGGCT GATCCTGCCG CGCGACATCA CCCTCGACGA GGCCGCGGAG
ATCGCCGCCG CAGTGCCCGA CCTGGAGATC GAGGCCTTCG TGCTCAACGA CGGCTGCGTC
TACGAGGAAG GCAGCTGCAA CACCCTGCAC CTCCCGGGCG CGCTCGGCGG GCCGATCTGC
CTGGACCGCT ACGCCTACGC GCACCGCCAC CGCGACGGCC GGCCGCTCTC GGCCGCGCTC
GCGGCCCGCC TGCAGGAGAA CGACGAGGCC TATCGGCGCT GGCTGTGGTA CCGCTTCTCC
TGCGGCTTCA CCACCACCGC CGACGGCCTG CCCTTCGGCC CCTGCGGCCT GTGCGCGATC
CCGGCGTTCG GGCGCGGCGG CATCCACGCG CTCAAGATCG CCGGCCGCGA GGGTCCGCCC
GAGCGCAAGC TCGCCAGCGT GCGCATGGTC CGGCGGATCC TCGACGCCCA CGACAACGGC
GAAGCCCCCG CGGCGGTGAT GGCCCGTGCG CGCAACCTGC GGCCTGCGCA CGAACACTGC
GCGACCGGCT TCATGTGCTA CTACCCGGAG GTCGTCTCCC GCGCATCCGA AGCGGCGCAG
CCGCTGTGCG ACGGTCGCGC AGCAGGTGCT CAGTAG
 
Protein sequence
MKIVAPIRQL DEIAALARAG ADELYCGVTP REWAERFGGA SANRRPGGNL PSLAALAEAV 
ALAHRNGVQL SLVLNAQQYS VEQIEFALAI AHRYVDMGGD AVIASDPGLL LALAEAEPEL
RIHVSSVATC RNADGARLYR ELGARRLILP RDITLDEAAE IAAAVPDLEI EAFVLNDGCV
YEEGSCNTLH LPGALGGPIC LDRYAYAHRH RDGRPLSAAL AARLQENDEA YRRWLWYRFS
CGFTTTADGL PFGPCGLCAI PAFGRGGIHA LKIAGREGPP ERKLASVRMV RRILDAHDNG
EAPAAVMARA RNLRPAHEHC ATGFMCYYPE VVSRASEAAQ PLCDGRAAGA Q