Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpet_1701 |
Symbol | |
ID | 5171439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga petrophila RKU-1 |
Kingdom | Bacteria |
Replicon accession | NC_009486 |
Strand | - |
Start bp | 1700689 |
End bp | 1701684 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640564227 |
Product | peptidase M42 family protein |
Protein accession | YP_001245282 |
Protein GI | 148270822 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGAAC TGATCAGAAA GCTGACGGAA GCCTTCGGTC CGAGTGGACG GGAAGAAGAG GTGAGAAGAA TCATCCTCGA AGAACTCGAA GGGTACATAG ATGGTCACAG AATCGATGGG CTCGGCAATC TCATAGTTTG GAAAGGAAGC GGTGAGAAGA AGGTGATACT GGACGCTCAC ATAGATGAGA TAGGTGTTGT TGTTACAAAC ATAGATGAAA AAGGATTTCT GACGATAGAA CCCGTCGGTG GTGTCTCTCC GTACATGCTT CTTGGAAAAA GGATCAGGTT CGAAAACGGT GTGGTAGGCG TTGTTGGTAT GGAAGGTGAA ACAACAGAAG AAAGGCAGGA GAATGTGAGA AAGCTCTCGT TCGACAAGCT GTTCGTTGAC ATCGGTGCAA GTTCCCGGGA GGAAGCACAG AAGATGTGCC CGATTGGAAG TTTCGGCGTC TACGACAGTG GATTCGTTGA AGTTTCCGGA AAATACGTCT CGAAGGCGAT GGACGACAGG ATAGGATGCG CCGTAATCGT CGAAGTTTTC AAAAGAATCA AACCCACTGT TACGCTCTAC GGTGTTTTCA GTGTTCAGGA AGAAGTGGGA TTGGTTGGTG CCTCGGTGGC AGGCTACGGC ATATCAGCGG ATGAGGCCAT TGCGATCGAT GTGACAGATT CGGCAGATAC ACCGAAGGCC ATAAAGAGAC ACGCTATGAG ACTCTCCGGT GGTCCCGCTT TGAAAGTCAA AGACAGAGCC TCGATCAGCA GCAGACGCAT CCTCGAAAAT CTGATAGAGA TCGCGGAAAA GTTCAGTATA AAGTATCAGA TGGAAGTTCT GACGTTCGGT GGCACGAACG CCATGGGATA TCAACGAACA AGAGAAGGAA TTCCTTCCGC CACGGTGTCT GTTCCCACAC GATACGTTCA TTCACCCAGT GAGATGATCG CACCGGATGA TGTTGAGGCA ACGGTCGATC TTCTCATCAG GTATCTGGGG GCGTGA
|
Protein sequence | MKELIRKLTE AFGPSGREEE VRRIILEELE GYIDGHRIDG LGNLIVWKGS GEKKVILDAH IDEIGVVVTN IDEKGFLTIE PVGGVSPYML LGKRIRFENG VVGVVGMEGE TTEERQENVR KLSFDKLFVD IGASSREEAQ KMCPIGSFGV YDSGFVEVSG KYVSKAMDDR IGCAVIVEVF KRIKPTVTLY GVFSVQEEVG LVGASVAGYG ISADEAIAID VTDSADTPKA IKRHAMRLSG GPALKVKDRA SISSRRILEN LIEIAEKFSI KYQMEVLTFG GTNAMGYQRT REGIPSATVS VPTRYVHSPS EMIAPDDVEA TVDLLIRYLG A
|
| |