Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1406 |
Symbol | |
ID | 6092848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1415365 |
End bp | 1416660 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642488608 |
Product | sun protein |
Protein accession | YP_001739433 |
Protein GI | 170289195 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACGA ACGTTAGACT GCTCGCCTAC AGACTTCTGA GGAAGTACGA GAAGGAAAAA TTCATATCGA GAGAGGACGT GGACAGCGTC CTCTCTTTTT TGGACGACAG AGACAGAAGA TTTTTCAAAG AGCTCGTGTG GGGAGTTGTG AGGAAAGAAG AACTTCTGGA CTGGTACATA AACCAGCTTT TGAAGAAAAA GGACGTTCCA CCTGCTGTCA GGGTTGCGTT GAGGATGGGG GCTTATCAGC TCCTCTTCAT GAACAGCGTT CCGGATTACG CAGCAGTCAG CGAAACAGTA AAACTGGTGA AGAACGAGAA CTTCAAAAAA CTGGTGAACG CCGTGCTGAG GAGATTGAGG ACCGTCCCCG AACCAAAAGA ACTTCACCTC GTCTACTCAC ATCCGGAGTG GATCGTGAAC TACTGGAGAT CATTTCTTCC CGAAAAAGCG GTCCTGAGAA TAATGAAGTG GAATCAGGAA CCTCTCCCTG TCATGCTCCG TGTGAATTCG CTCGCAGCCA CTAAGGAGGA AGTCATCAAA ATCCTTGCTG AAGAGGGTAC GGAGGCAATC CCGGGAAAAC ACGCTCCGTT TTCCCTGATT GTGAGGAAAC TCGGCGTTCC AATGAACGAC TCCAGGGTGA TAAACGATGG GCTCGCGAGT GTTCAGGGAG AGTCTTCACA GCTTGTACCT TTCTTCATGG AGCTGAGACC CGGACTGAGA GTACTGGATA CCTGCGCGGC ACCGGGTGGT AAGACTACCG CCATCGCCGA ATTGATGAAG GATCAGGGGA AGATACTGGC CGTTGATATA AGCAGAGAGA AAATCCAGCT CGTCGAAAAA CACGCGAAAC GTCTGAAACT CTCTTCGATA GAGACAAAGA TCGCCGATGC GGAACGACTC ACAGAGTACG TCCAGGATAC ATTTGATAGG GTCCTGGTGG ATGCTCCCTG TACCTCGCTG GGCACGGCAA GAAACCATCC GGAAGTTCTG AGGAGAGTGA ACAAAGAGGA TTTCGAGAAG TTCTCGGAGA TTCAGCTGAG GATGGTACAG CAGGCCTGGC AGCTTCTGGA AAAGGGAGGA ATTCTCCTCT ACAGCACATG TACCGTGACA AAAGAAGAAA ACACTGAAGT GGTGAAAAGA TTCGTCTACG AACAGAAAGA CGCAGAAGTG ATCGATATCA GAGACAAGAT GAAAGAATTC GAAGTGGAAG GAATCTGGGA TGGTTACGGC TTTCTGATGC TTCCAGACGA GACGATAACT CCCTTCTACA TCTCCGTTCT CAGAAAGATG GGATGA
|
Protein sequence | MRTNVRLLAY RLLRKYEKEK FISREDVDSV LSFLDDRDRR FFKELVWGVV RKEELLDWYI NQLLKKKDVP PAVRVALRMG AYQLLFMNSV PDYAAVSETV KLVKNENFKK LVNAVLRRLR TVPEPKELHL VYSHPEWIVN YWRSFLPEKA VLRIMKWNQE PLPVMLRVNS LAATKEEVIK ILAEEGTEAI PGKHAPFSLI VRKLGVPMND SRVINDGLAS VQGESSQLVP FFMELRPGLR VLDTCAAPGG KTTAIAELMK DQGKILAVDI SREKIQLVEK HAKRLKLSSI ETKIADAERL TEYVQDTFDR VLVDAPCTSL GTARNHPEVL RRVNKEDFEK FSEIQLRMVQ QAWQLLEKGG ILLYSTCTVT KEENTEVVKR FVYEQKDAEV IDIRDKMKEF EVEGIWDGYG FLMLPDETIT PFYISVLRKM G
|
| |