Gene Tmz1t_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1993 
Symbol 
ID7083748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2255495 
End bp2256781 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content72% 
IMG OID643699018 
ProductSte24 endopeptidase 
Protein accessionYP_002355640 
Protein GI217970406 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0368513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC CCGCCCCATT CATGCTGTCG CCCTTCGGCC TGCCGCCTCT TTCCGCCCTG 
TTCCTGGCCT TCCTGGTCGC CGGTACCGTG CTCGGCCTCG GTCTGCTGCA TCGCCACGCC
CACCACGTGC GCCGCCATCG CGACGCGGTG CCGCAGCCCT TCGCCGGCTC GATTCCCCTG
CATTCGCACC AGCGTGCCGC CGACTACACC GTCGCGCGCG CGCGCCTGTC CGCCTTCCAC
GCCGCGGCCA ACGCCGGCTT CGTGCTCGCG CTCACGCTCG GCGGCGGGCT GCAGGCGATG
CACGACGCCT GGGCGGACGT GCTGCCCGCC GGCGGGCTCG CCCACGGCGT CGCGCTGCTC
GCCAGCCTGG GCGTGCTCGG CTGGCTGTTC GAACTACCCT TCGCGCTGCT GCGCACCTTC
GGCATCGAGA GGACCTTCGG CTTCAACCGC ATGACGCCGC GCCTCTACCT CGCCGACACC
GTGCGCGAGG CCGCGCTCGC CGCGCTGATC GGGCTGCCGC TGCTCGCCGC GGTGCTGTGG
CTGACGCTGG CGACGGGCGC GCTGTGGTGG GCCTGGGTGT GGGCGTTCTG GCTCGGCTTC
AACCTGCTCG CGATGGTGAT CTGGCCGACC TTCATCGCGC CGCTGTTCAA CAAGTTCACC
CCGCTCGCCG ACGCCACGCT GAAGGCGCGC GTCGAGGCCC TGCTCGCGCG CTGCGGCTTT
CGCGCCAAGG GCCTGTTCGT GATGGACGGC TCGCGCCGCT CGGCACACGG CAACGCCTAC
TTCACCGGGC TGGGCGCGGC CAAGCGCATC GTGTTCTTCG ACACCCTGCT CGACAAGCTC
GATGCCGACG AGGTCGAGGC GGTGCTCGCG CACGAGCTCG GCCACTTCCA CCACCGCCAC
CTGCTGCGCC GGCTGGCGGT GCTCGCCCCG GCCAGCCTGG GCGTGCTCGC CTTGCTCGGC
TGGCTCGCCC AGCAGCCCTG GTTCTTCTCC GGGCTGGGCA TGCAAAGCGC CGACCTGGCG
AGCGCGCTCG CCCTGTTCAC GCTGGTACTG CCGGTGTTCA GCTTCCCGCT CGCGCCGCTC
GCGAGCCACT GGTCGCGCAA GCACGAGTTC GAGGCCGACG CCTACGCCGC CCGCCAGGCC
GACGCCGGCA AGCTGGTGAG CGCGCTGGTC AAGCTCTACC GCGACAACGC CTCCACGCTG
ACGCCCGACC CGCTGTACTC GCGCTTCCAC GACTCGCATC CGCCCGCCGC GCTGCGCATC
GCGCGCCTGC AGGCGCTGCA ACGGTGA
 
Protein sequence
MTDPAPFMLS PFGLPPLSAL FLAFLVAGTV LGLGLLHRHA HHVRRHRDAV PQPFAGSIPL 
HSHQRAADYT VARARLSAFH AAANAGFVLA LTLGGGLQAM HDAWADVLPA GGLAHGVALL
ASLGVLGWLF ELPFALLRTF GIERTFGFNR MTPRLYLADT VREAALAALI GLPLLAAVLW
LTLATGALWW AWVWAFWLGF NLLAMVIWPT FIAPLFNKFT PLADATLKAR VEALLARCGF
RAKGLFVMDG SRRSAHGNAY FTGLGAAKRI VFFDTLLDKL DADEVEAVLA HELGHFHHRH
LLRRLAVLAP ASLGVLALLG WLAQQPWFFS GLGMQSADLA SALALFTLVL PVFSFPLAPL
ASHWSRKHEF EADAYAARQA DAGKLVSALV KLYRDNASTL TPDPLYSRFH DSHPPAALRI
ARLQALQR