Gene Tmz1t_1793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1793 
Symbol 
ID7085763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2013382 
End bp2015349 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content68% 
IMG OID643698815 
Productpeptidase U32 
Protein accessionYP_002355441 
Protein GI217970207 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCG ACCGCCACAC CCTCGAACTC CTCGCCCCCG CCAAGACCGC CGACTTCGGC 
ATCGAGGCCA TCGACCACGG CGCGGACGCG GTCTACATCG GCGGCCCGGC CTTCGGCGCG
CGCTCCTCCG CGGACAACTC GGTGGAGGAC ATCGCGCGCC TCGTGCAGCA CGCCCACCGC
TATCACGCCG AGGTCTTCGT CGCCACCAAC ACCATCCTCT TCGACCACGA GATCGAGCCG
GCGCGCAAGC TGATCTGGCA ACTGTACGAT GCCGGCGTCG ACGCGCTCAT CGTGCAGGAC
ATGGGCCTGC TCGAGCTCGA CCTGCCGCCG ATCCAGCTCC ACGCCAGCAC CCAGACCGAC
ATCCGCGACG CGAGCAAGGC GCGCTTCCTG CAGGACGTGG GCTTCTCGCA GATCGTGCTG
GCGCGCGAGC TGTCGTTGAA CGAGGTGAAG AAGATCGCCG CGGCCACCAC CTGCCAGCTC
GAATACTTCG TGCACGGCGC ACTGTGCGTG GCCTTCTCCG GCCAGTGCTA CATCAGCCAC
GCGCACACCG GGCGCAGCGC CAACCGCGGC GAGTGCTCGC AGGCCTGCCG GCTGCCCTAC
GACCTCAAGG ACAAGGACGG CCACACCGTC GCCAGCAACC AGCACATGCT GTCGATGAAG
GACAACAACC AGAGCGCCAA CTTGCGCGCG CTCGCCGCCG CGGGGGTAAG CTCGTTCAAG
ATCGAGGGCC GCTACAAGGA TCTCTCCTAC GTCAAGAACA TCACCGCGCA CTACCGCACG
CTGATCGACG AGATCCTCGA GCACCCGGAC ACCGACGGCG CCCGATATCG CCGCGCCTCC
AGCGGACGCA CCACCTTCTT CTTCACCCCG CAGGCCGACA AGACCTTCAA CCGCGGCTAC
ACGGACTACT TCACCAACGA TCGCCGCCAC GGCATCGAGG CGTTCGAATC GCCCAAGTTC
GTCGGCGAGC CCATCGGCCG CGTGAAGAAG ATCGACACCA AGGGGCGCAC GTTCTTCGAC
GTCGAGCGCG CGGCGCCGAT CCACAACGCC GACGGCCTGA CCTGGTACGA CCCCAAGGGC
GAGCTCACCG GCCTGCGGGT GAACCGCGCC GAGGCGGACG GCGGCGGCGA AGGCATCGAC
CGCATCTTCC CTGCCGACCC CCTGCCGACC GACCTGGTCC CCGGCACCTC GCTGTTCCGC
AACCATGACC ACGAATTCGA GCGCGCGCTG GGGAAGAAGT CGGCCGAGCG CCGCATCCGT
GTCGATGCAC GTTTCGCCGC CACCCACGAC GGCTTCGCGC TGACCCTGAC CGACGAGGAC
GGCGTCGCCG TCACCGCGAC GCTTGCCGCC GCCTTCGAGC CGGCACAGAA CGCCGAGCGT
GCACTCGCCA CCCTGCGCGA GCACCTCGGC AAGCTGGGCA ACACGATCTT CAGCGCAGGC
GAGCTCGTGC TCGACCTGCC CGCCGCGCCC TTCCTGCCCG CCGGACAGCT CAATGCGCTG
CGCCGCGATG CCGTGGAGCG GCTCGAGGCC GGCCGCCTCG CGGCCCACGC CCGCCCGCTG
CGGGCCGCGC CGGTCGAGCC GCCGGTGCCC TACCCGCAGG ACGCGCTGAG CTACCTCGCC
AACGTGTCGA ACGACAAGGC GCGCGCCTTT TACGCCCGCC ACGGCGTCAA GCTGATCGAC
GCCGCATACG AGGCCAACGA GGAGCGCGAC GACGTCTCGC TGATGATCAC CAAGCACTGC
CTGCGCTACA GCTTCAATCT GTGTCCGAAG GAGGTCAAGG GCATCCGCCC CGACCCGATG
CAGTTGGTCA ATGGCGACGA GACGCTGACG CTGAAGTTCG ACTGCAAGCG CTGCGAGATG
CACGTCATCG GCGCGCTGCG TCCCCACGTG GCGAAGATGC GCGACACCGT GGTGGCGCAC
AAGGTGAGCT TCGTACCCCA GCGCAAGACC AACGCCTCCG TGCGCTGA
 
Protein sequence
MSIDRHTLEL LAPAKTADFG IEAIDHGADA VYIGGPAFGA RSSADNSVED IARLVQHAHR 
YHAEVFVATN TILFDHEIEP ARKLIWQLYD AGVDALIVQD MGLLELDLPP IQLHASTQTD
IRDASKARFL QDVGFSQIVL ARELSLNEVK KIAAATTCQL EYFVHGALCV AFSGQCYISH
AHTGRSANRG ECSQACRLPY DLKDKDGHTV ASNQHMLSMK DNNQSANLRA LAAAGVSSFK
IEGRYKDLSY VKNITAHYRT LIDEILEHPD TDGARYRRAS SGRTTFFFTP QADKTFNRGY
TDYFTNDRRH GIEAFESPKF VGEPIGRVKK IDTKGRTFFD VERAAPIHNA DGLTWYDPKG
ELTGLRVNRA EADGGGEGID RIFPADPLPT DLVPGTSLFR NHDHEFERAL GKKSAERRIR
VDARFAATHD GFALTLTDED GVAVTATLAA AFEPAQNAER ALATLREHLG KLGNTIFSAG
ELVLDLPAAP FLPAGQLNAL RRDAVERLEA GRLAAHARPL RAAPVEPPVP YPQDALSYLA
NVSNDKARAF YARHGVKLID AAYEANEERD DVSLMITKHC LRYSFNLCPK EVKGIRPDPM
QLVNGDETLT LKFDCKRCEM HVIGALRPHV AKMRDTVVAH KVSFVPQRKT NASVR