Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2835 |
Symbol | |
ID | 7873243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3072330 |
End bp | 3073343 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643699756 |
Product | proline iminopeptidase |
Protein accession | YP_002889811 |
Protein GI | 237653497 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGCCG CCCCGGCGCC CTCGCGCGGC GCGCCGCCGC TGTGGGCCGA GGCGGCGCCG TTCTGCACGC ACCGCCTCGC GGTGGGGAGC GGCCACGTGC TGCACGTGGA GGAATGCGGC CGCGCCGACG GCATCCCGGT GGTCTTCCTG CACGGTGGCC CGGGCAGCGG CTGCAGCCCA CGCCAGCGCG GGCTGTTCGA CCCCGCACGA TTTCGCGCCG TGCTCTTCGA CCAGCGCGGC GGCGGTCGCA GCACGCCGCT CGGCGGCCTG CGCGCCAATA CCACCGCCCA CCTCGTCGCC GACATCGAGC GCATCCGCAC AGCGCTGGCG ATCGAGCGCT GGATCGTGTT CGGCGGCTCC TGGGGCAGCC TGCTCGCGCT CGAATACGCC GGCGCGCACC CGCATCGCGT CGCGGGTCTG GTGCTGCGCG GCATCTTCCT CGGCTCGCCG GCGGAACTGC GTACCTACAC CGAGCGCATT CCTCCACGCG CACCCGGGCT GCGCCAGCGC CTCGCGGAGG AAGCGCTCAT CCGCTTGCCC CGGGCCCGAT CCCGGCACGC GGAAGACGAT CTGCTCGCCA CCTGGTGCCG CCGCATGCTC GCCGGCCGCC CCGAGACGAG GTGCGCCGCC GCGCGCCACT GGCTGGACCA CGAGCGCGCG CTGATGGGCG AGCCGCCGCT CGCCGCCCCG CCCGACGCCC GCGAACTCGC CAAGGCGCGC ATCCAGGCGC ATTACCTCGC CCACGGCTGC TTCACCGACG CCGCACGCCT GCTCGCCACC TGCGCGGCCT TGCGCCACCT GCCGGCGGCG ATCGTGCATG GCGCCGACGA TCCGGTGTGC CCGCCCGCCA CTGCGCGCGC GCTGCACCGC GCATGGCCGG CGGCGGAATA CACCGAGGTC ACCGGCGCGG GCCACTCCGG GCTGGATGCG GCGATCGCCG CCGCCTGCGT CGCCGCACTC GACCGTGTCG CAGAGTGCGC CCACCGCGGC GCCCACCCCC GCCGCAGCCG CTAA
|
Protein sequence | MDAAPAPSRG APPLWAEAAP FCTHRLAVGS GHVLHVEECG RADGIPVVFL HGGPGSGCSP RQRGLFDPAR FRAVLFDQRG GGRSTPLGGL RANTTAHLVA DIERIRTALA IERWIVFGGS WGSLLALEYA GAHPHRVAGL VLRGIFLGSP AELRTYTERI PPRAPGLRQR LAEEALIRLP RARSRHAEDD LLATWCRRML AGRPETRCAA ARHWLDHERA LMGEPPLAAP PDARELAKAR IQAHYLAHGC FTDAARLLAT CAALRHLPAA IVHGADDPVC PPATARALHR AWPAAEYTEV TGAGHSGLDA AIAAACVAAL DRVAECAHRG AHPRRSR
|
| |