Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0159 |
Symbol | |
ID | 6091561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 152318 |
End bp | 154231 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642487340 |
Product | hypothetical protein |
Protein accession | YP_001738203 |
Protein GI | 170287965 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00185609 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTTTGA GAGAGATAAA CCGATACTGC AAAGAAAAAG CCACCGGAAA GAGAATCTAC GCAGTTCCAA AGCTGTGGAT ACCGAGTTTC TTCAAAAAGT TCGACGAAAA ATCCGGCAGG TGCTTCGTCG ATCCTTACGA ACTCGGAGCC GAGATCACCG ACTGGATTTT GAATCAGTCC AGAGAGCAGG ATTATTCCCA GCCTATTTCA TTTCTTAAGG GCGAGAAAAC ACCGGACTGG ATTAAGCGTT CCGTTGTTTA TGGATCCCTC CCCAGGACCA CCACGGCGTA CAATCACAAA GGCTCTGGAT ACTACGAAGA GAACGACGTT CTTGGTTTCA GAGAAGCGGG AACGTTCTTC AAGATGATGC TGCTTCTTCC GTTCATCAAA AGTCTCGGTG CGGACGCTAT CTATTTACTT CCCGTGAGTA GAATGAGCGA TCTCTTCAAG AAGGGAGACG CTCCATCACC GTACTCCGTG AAGAATCCAA TGGAGCTCGA TGAGAGGTAC CACGATCCGC TTCTCGAACC TTTCAAGGTG GATGAAGAGT TCAAGGCCTT TGTGGAAGCG TGTCACATCC TCGGAATCAG AGTGATTCTC GATTTCATTC CAAGAACGGC TTCCAGAGAC TCTGATCTCA TAAGAGAACA TCCGGACTGG TTCTACTGGA TAAAGGTGGA GGAACTTGCA GATTACACTC CTCCAAGGGC CGAGGAACTT CCGTTCAAGG TGCCGGATGA GGATGAACTC GAGATCATAT ACAGCAAAGA AAATGTGAAA AGACACCTCA AAAAGTTCAC ACTTCCTCCG AATCTGATCG ACCCTCAAAA GTGGGAGAAA ATAAAAAGAG AAGAGGGGAA CATTCTGGAG TTGATTGTGA AAGAATTTGG AATCATCACT CCTCCAGGAT TTTCCGATTT GATCAACGAT CCACAACCTA CATGGGATGA TGTCACGTTT TTGAGGTTGT ACTTGGATCA CCCGGAGGCT TCGAAAAGAT TTCTCGAGCC GAACCAGCCT CCCTACGTTC TCTACGACGT AATAAAGGCG AGCAAATTTC CTGGAAAAGA GCCGAACAGA GAGCTCTGGG AGTACCTCGC GGGCGTGATA CCACATTACC AGAAAAAATA CGGAATAGAC GGTGCAAGAC TCGATATGGG GCACGCACTT CCCAAAGAAC TTCTTGACCT CATAATAAAG AACGTGAAGG AGTACGATCC CGCATTTGCG ATGATCGCAG AGGAGCTGGA CATGGGGAAG GACAAAGTAT CGAAGGAAGC GGGATATGAC GTGATCCTGG GAAGTAGCTG GTACTTTGCG GGAAGAGTGG AGGAAATAGG AAAACTCCCT GAAATCGCCG AAAAGCTCGT TCTTCCTTTC CTCGCCTCCG TTGAGACTCC CGACACACCG CGCATTGCCA CAAGAAAGTA CGCTTCCAAG ATGAAAAAAC TGGCACCGTT TGTAACCTAC TTTCTACCGA ACTCTATTCC CTATGTGAAC ACGGGACAGG AGATTGGAGA GAAACAGCCC ATGAACCTGG GGCTGGACAC GGATCCAAAC CTGAGAAAAG TCCTCTCCCC AACCGACGAG TTTTTCGGGA AACTCGCATT TTTCGACCAC TACGTTCTCC ACTGGGACAG CCCGGACAGA GGAATCTTGA GCTTCATCAA AAAACTGATA AAGGTGCGCC AGCAGTTCCT CGATTTTGTC CTCAACGGAA AGTTTGAAAA CCTCACAACG GAAGATCTCG TCATGTACTC TTACGAGAGA AACGGACAAA AGATCATCGT CGCCGCAAAT GTTGGAAAAG AGCCAAAAGA GATCACCGGC GGAAGGGTTT GGAACGGAAA GTGGAGTGAT GAAGAGAAGG TAGTCCTCAA ACCCCTTGAT TTTGTTCTTG TTGTACAGGA GTGA
|
Protein sequence | MLLREINRYC KEKATGKRIY AVPKLWIPSF FKKFDEKSGR CFVDPYELGA EITDWILNQS REQDYSQPIS FLKGEKTPDW IKRSVVYGSL PRTTTAYNHK GSGYYEENDV LGFREAGTFF KMMLLLPFIK SLGADAIYLL PVSRMSDLFK KGDAPSPYSV KNPMELDERY HDPLLEPFKV DEEFKAFVEA CHILGIRVIL DFIPRTASRD SDLIREHPDW FYWIKVEELA DYTPPRAEEL PFKVPDEDEL EIIYSKENVK RHLKKFTLPP NLIDPQKWEK IKREEGNILE LIVKEFGIIT PPGFSDLIND PQPTWDDVTF LRLYLDHPEA SKRFLEPNQP PYVLYDVIKA SKFPGKEPNR ELWEYLAGVI PHYQKKYGID GARLDMGHAL PKELLDLIIK NVKEYDPAFA MIAEELDMGK DKVSKEAGYD VILGSSWYFA GRVEEIGKLP EIAEKLVLPF LASVETPDTP RIATRKYASK MKKLAPFVTY FLPNSIPYVN TGQEIGEKQP MNLGLDTDPN LRKVLSPTDE FFGKLAFFDH YVLHWDSPDR GILSFIKKLI KVRQQFLDFV LNGKFENLTT EDLVMYSYER NGQKIIVAAN VGKEPKEITG GRVWNGKWSD EEKVVLKPLD FVLVVQE
|
| |