Gene Tmz1t_2835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2835 
Symbol 
ID7873243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3072330 
End bp3073343 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content76% 
IMG OID643699756 
Productproline iminopeptidase 
Protein accessionYP_002889811 
Protein GI237653497 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCCG CCCCGGCGCC CTCGCGCGGC GCGCCGCCGC TGTGGGCCGA GGCGGCGCCG 
TTCTGCACGC ACCGCCTCGC GGTGGGGAGC GGCCACGTGC TGCACGTGGA GGAATGCGGC
CGCGCCGACG GCATCCCGGT GGTCTTCCTG CACGGTGGCC CGGGCAGCGG CTGCAGCCCA
CGCCAGCGCG GGCTGTTCGA CCCCGCACGA TTTCGCGCCG TGCTCTTCGA CCAGCGCGGC
GGCGGTCGCA GCACGCCGCT CGGCGGCCTG CGCGCCAATA CCACCGCCCA CCTCGTCGCC
GACATCGAGC GCATCCGCAC AGCGCTGGCG ATCGAGCGCT GGATCGTGTT CGGCGGCTCC
TGGGGCAGCC TGCTCGCGCT CGAATACGCC GGCGCGCACC CGCATCGCGT CGCGGGTCTG
GTGCTGCGCG GCATCTTCCT CGGCTCGCCG GCGGAACTGC GTACCTACAC CGAGCGCATT
CCTCCACGCG CACCCGGGCT GCGCCAGCGC CTCGCGGAGG AAGCGCTCAT CCGCTTGCCC
CGGGCCCGAT CCCGGCACGC GGAAGACGAT CTGCTCGCCA CCTGGTGCCG CCGCATGCTC
GCCGGCCGCC CCGAGACGAG GTGCGCCGCC GCGCGCCACT GGCTGGACCA CGAGCGCGCG
CTGATGGGCG AGCCGCCGCT CGCCGCCCCG CCCGACGCCC GCGAACTCGC CAAGGCGCGC
ATCCAGGCGC ATTACCTCGC CCACGGCTGC TTCACCGACG CCGCACGCCT GCTCGCCACC
TGCGCGGCCT TGCGCCACCT GCCGGCGGCG ATCGTGCATG GCGCCGACGA TCCGGTGTGC
CCGCCCGCCA CTGCGCGCGC GCTGCACCGC GCATGGCCGG CGGCGGAATA CACCGAGGTC
ACCGGCGCGG GCCACTCCGG GCTGGATGCG GCGATCGCCG CCGCCTGCGT CGCCGCACTC
GACCGTGTCG CAGAGTGCGC CCACCGCGGC GCCCACCCCC GCCGCAGCCG CTAA
 
Protein sequence
MDAAPAPSRG APPLWAEAAP FCTHRLAVGS GHVLHVEECG RADGIPVVFL HGGPGSGCSP 
RQRGLFDPAR FRAVLFDQRG GGRSTPLGGL RANTTAHLVA DIERIRTALA IERWIVFGGS
WGSLLALEYA GAHPHRVAGL VLRGIFLGSP AELRTYTERI PPRAPGLRQR LAEEALIRLP
RARSRHAEDD LLATWCRRML AGRPETRCAA ARHWLDHERA LMGEPPLAAP PDARELAKAR
IQAHYLAHGC FTDAARLLAT CAALRHLPAA IVHGADDPVC PPATARALHR AWPAAEYTEV
TGAGHSGLDA AIAAACVAAL DRVAECAHRG AHPRRSR