Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1758 |
Symbol | |
ID | 6093209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1774886 |
End bp | 1775881 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642488955 |
Product | peptidase M42 family protein |
Protein accession | YP_001739772 |
Protein GI | 170289534 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGAAC TGATCAGGAA GCTGACGGAA GCCTTTGGCC CAAGTGGACG GGAAGAAGAG GTGAGAAGAA TCATCCTCGA GGAACTCGAA GGGCACATAG ATGGCCACAG GATCGATGGG CTCGGCAATC TCATTGTTTG GAAAGGAAGC GGCGAGAAAA AGGTGATACT GGACGCTCAC ATAGATGAGA TAGGTGTTGT CGTCACAAAT GTGGACGACA AGGGATTTCT GACGATAGAA CCCGTCGGCG GTGTCTCTCC GTACATGCTT CTTGGAAAAA GGATCAGGTT CGAAAACGGT ACAATAGGCG TTGTTGGTAT GGAAGGTGAA ACAACAGAAG AAAGGCAGGA GAATGTGAGA AAGCTCTCAT TCGACAGGCT GTTCGTCGAT ATCGGTGCAA ATTCCAGGGA AGAAGCGCAG AAGATGTGTC CGATTGGAAG CTTCGGTGTC TACGACAGTG GATTCGTTGA GGTTTCCGGG AAATACGTCT CGAAGGCGAT GGATGACAGG ATAGGATGTG CCGTGATCGT GGAAGTTTTC AAAAGAATCA AACCCGCTGT TACGCTCTAC GGTGTTTTCA GTGTTCAGGA AGAAGTGGGA CTGGTCGGTG CCTCGGTAGC GGGGTACGGC GTACCAGCGG ACGAGGCCAT CGCGATCGAT GTGACTGATT CGGCAGACAC TCCGAAGGCC ATCAAGAGAC ACGCAATGAG GCTCTCCGGT GGACCCGCTC TGAAAGTGAA AGACAGGGCA TCGATCAGCA GCAAACGCAT CCTCGAAAAT TTGATAGAAA TCGCGGAAAA ATTCGATATA AAGTATCAGA TGGAGGTTCT GACGTTCGGC GGTACGAACG CCATGGGGTA CCAGCGGACT AAAGAAGGAA TTCCTTCGGC CACGGTGTCT ATTCCCACAC GATACGTTCA CTCACCCAGT GAGATGATCG CACCAGATGA CGTTGAGGCA ACGGTCGATC TTCTCATCAG GTATCTGGGG GCGTGA
|
Protein sequence | MKELIRKLTE AFGPSGREEE VRRIILEELE GHIDGHRIDG LGNLIVWKGS GEKKVILDAH IDEIGVVVTN VDDKGFLTIE PVGGVSPYML LGKRIRFENG TIGVVGMEGE TTEERQENVR KLSFDRLFVD IGANSREEAQ KMCPIGSFGV YDSGFVEVSG KYVSKAMDDR IGCAVIVEVF KRIKPAVTLY GVFSVQEEVG LVGASVAGYG VPADEAIAID VTDSADTPKA IKRHAMRLSG GPALKVKDRA SISSKRILEN LIEIAEKFDI KYQMEVLTFG GTNAMGYQRT KEGIPSATVS IPTRYVHSPS EMIAPDDVEA TVDLLIRYLG A
|
| |