Gene Tmz1t_2222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2222 
Symbol 
ID7083654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2503120 
End bp2504457 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID643699242 
Productpeptidase M48 Ste24p 
Protein accessionYP_002355858 
Protein GI217970624 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTCT TCGACGACCT CGCCCGCAGC CTCGAACGCA ACCGCATCAC CCGCCGCCAG 
GCGCTGTGGC TGCTCGGCGC GGGCGCGGCG GCCGGCCTGT CGGGCTGCGC CACCTCCCCG
GTCACCGGCG AGACCATCCT CGTCGGCATG AGCGAGGCGC AGGAGAAGCA GACCGACGCC
CAGGTCGCGC CGCACCAGTT CTCGCAGGAC CTCGGCGCCA TCCAGGACGA GGCGGTCAAC
CGCTACGTCG CCGGCATCGG CCAGCGCATG GGCACGCTCA CCCACCGTCC GCAGATGCCC
TACTCCTACC GCGTGCTCAA CGCCAACTAC GTCAACGCCT ACACCTTCCC CGGCGGCGCG
ATGGGCGTGA CGCGCGGCAT CCTCGCCGAC CTCGACGACG AGGCCCAGCT CGCCGCCCTG
CTCGGCCACG AGCTCGGCCA CGTCAACGCC CGCCACGCCG CCCAGCGCCA GGGCCAGAAC
CTGGTCGCGC AGGCGGCGCT CGCCGGGCTC AACGTGGCGG CACAGAGCTC CGACTGGGGC
GGGCTGATGA GCATGGGCGG ACAGATCGGC GCCAGCGCCC TGCTCGCCGG CTACTCGCGC
GAGCACGAGC GCGAGGCCGA TGCGCTCGGG CAGGAATATC TCGTCAAGGC CGGCTACCCG
GCGACCGGCA TGGTGCGCCT GCACCAGTTG CTGGTTGCCG AGGAAAAATC CGCCCCCTCG
CTGCTGCAGA CGATGTTCTC CACCCACCCG ATGAGCAGCG AGCGCATGCA GGCCGCGCAG
GCCGCGGCCG ACGCGCGCTA CCGCATCAGC AACAGCCTGG ACGCCCGCCG CGAGCGCTTC
ATGGACAGCA CCGCCAGCCT GCGCCGCATC CGCCCCACCA TCGACGCCTG CAAGAACGGC
GAAACCGCGA TGGCCGCCAG GCAGTACCCC AAGGCGCAGG CCGAATTCCA GACCGCGCTG
GCCAGGACCC CGCGCGACTA CGCCAGCAAC CTGCGCATGG CCCAGTGCCT GCAGGCCCAG
GGCCAGACCG CGAAGGCGGT GGACTACGCC GACAACGCGC GCGAGATCTA CCCGCAGGAG
GCGCAGGCCT ACAAGCTCGC CGGCGTGCTC GCCCTGCAGC AGCGCGACGC CGGCCGCGCC
TACCAGAACC TCGACCGCTT CGACCGCCTG CTCCCCGGCG ACGCCGGCAT CACCTTCCTG
AAGGGCATCT CGCTCGAAGG CATGGGCAAC CGCCAGGCCG CCGCCCAGCA CTACGCCGCC
TACCTGCGCC AGAGCCAGCA GGGCAACGCC GCGCAGTACT CGTACAACCG GCTCAAGGCC
TGGGGGATGG TGAAGTAG
 
Protein sequence
MSFFDDLARS LERNRITRRQ ALWLLGAGAA AGLSGCATSP VTGETILVGM SEAQEKQTDA 
QVAPHQFSQD LGAIQDEAVN RYVAGIGQRM GTLTHRPQMP YSYRVLNANY VNAYTFPGGA
MGVTRGILAD LDDEAQLAAL LGHELGHVNA RHAAQRQGQN LVAQAALAGL NVAAQSSDWG
GLMSMGGQIG ASALLAGYSR EHEREADALG QEYLVKAGYP ATGMVRLHQL LVAEEKSAPS
LLQTMFSTHP MSSERMQAAQ AAADARYRIS NSLDARRERF MDSTASLRRI RPTIDACKNG
ETAMAARQYP KAQAEFQTAL ARTPRDYASN LRMAQCLQAQ GQTAKAVDYA DNAREIYPQE
AQAYKLAGVL ALQQRDAGRA YQNLDRFDRL LPGDAGITFL KGISLEGMGN RQAAAQHYAA
YLRQSQQGNA AQYSYNRLKA WGMVK