Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1808 |
Symbol | |
ID | 6093259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | - |
Start bp | 1830025 |
End bp | 1833051 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642489005 |
Product | hypothetical protein |
Protein accession | YP_001739822 |
Protein GI | 170289584 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0837285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAT TCCTTGAAAG TCTCAAATTC CCGGTCCAGG AAGTAAATAA AAAAAGTTCC GGCGAGAAAG GACCTGGAAG GCCTCCGTAC TGGGAGATGG TGTTTTACTG GACGAGGAAA CCTCTCGTAG GTGCAAGGTC TGTAATAGCG GGGGCACTGC TTCCCGAAAA CGTGGATGAA AATCTATTCA AAGCGGCTGT AAGACTTTCT TCGCCCACAC CCCACAGAGA GAACCCTCAA ATTCCCGCGG AATTTGCGAA GTACTTTGAA GGGAAAAAGT TGCTCGATCC CTTCGCTGGG TTCGGTTCGA TTCCCCTCGA AGGGCTGAGA CTCGGCCTTG ATGTCACGGC TGTTGAACTT CTACCAACAA CCTACATTTT TCTCAAGGCC GTTCTCGAAT ATCCAAAGAA GTTCGGGAAA TCCCTCGTGA AAGACGTTGA AAGGTGGGGT GAGTGGATAA CCGAGCAACT CAAGAATGAC CCTGAAATCA GGGAGCTTTA CGACGATGAC GTGGCCGTTT ACATCGGAAC GTGGGAAATC AGGTGCCCTC ACTGTGGGCG GTGGACTCCT GCGATAGGAA ATTTCTGGCT GGCAAGAGTT AAGGACGGCA AGGGCTACAA GAGGCTCGCC TACATGAAGC CTGAGAGGAA GGGCGATGAG ATTGAAATAA GGGTGATTGA CCTCAACGAA ATCCTGGACG ATATTTCTAA AGCGAATGTT GACGGCAATG AGATCATCTT CGAAGGAGAA AACTACGTGA AAACAGTAGA AGAAGCTGTA AGAGATGGAA AGCTGAAACA GAGTGATGTG AAGATAGATG GAAACACGGT TATCTTCGAA GTTCCTTCGG CGAATATCGA ATTAAGACGG AGCCAGCTCA CCTGTCTTAT GTGTGGAAAT GTCATAAAGT ACGCGGATGA AAATGGAAAT CACCACATGA AGCTGAAAAA CGGGGATTTT TATGTGAAGT TCGCGCTGAG AAAGTACCAC GAAGGTGATG AGCGCTTCGC AAGGCAGAGA CTGCTCGTGA AGGTGAAAGT CAAGGATGGA GATCTGATCT TTGAACCAGC GACAAAGGAA GACAGCGAAA AACTCTGGAA AGCGAAGGAG AAGGTCAGGG AGATGCTTGA GAAGGGAGAT CTGGACGTGC CGAGTGAAGC AATACCTCTT TACGAGAACC GCCGTATTAC TCCAATACTC AGTGCAGAAA AATGGTATCA GTTCTTCAAC CCCCGTCAGC TTCTCACCCT CATAAAGATC GTGAGGCTGA TAAGGGAAGT TGGCAGAAAG GTCGAGGAGG AAAAGATTGC CGAGGGCTGG AATAAAGAAA GGGCATTTGA GTATGCGGAG GCAGTGGCAA CGTATTTGAG CACGGCGATG TTGAAGTATG CGTACTACAA TTCAATCGTT ACCCGGTGGG ATTCTACTTG GTGGAAAATT GGGGAAACAA TGTCCACCCG TGGAATCGCA ATGAATTGGA ACTGGACAGA AAGTCCTTGG TTTAGTAGTT TTGGTGGGAT GATCAAAACA CTCCTTGCAA TATTAAGGGG TGTTAAATAC CTCACCTCCG CTCTCTCCTC CTCCCAGAGA ACCCTTGCGG ATTTCACAGA AAACTCCGTT AAAGTTCTCC AGGGCGACGC GACCTCGCTC AACCTTGGAG AGAAGTTCGA CGTCATCGTA ACTGATCCAC CCTATGCAGA TGATGTCCCA TACACAGAAC TTTCTGACTT CTACTACGTC TGGCTCAAAA GGGCTCTGAG CGACGTTGAG AACGGAAAAC TCATTCCAAG GTTCCACAAG GAAGCGTTTT TCAAAAGAAT CGGCCCCAAA TGGGTGGAAA TCAAGACCCA GTGGCAGGAA TTTGCGAAGA AAGAAGTTTC GACGAATCCG GGTCGTTTCA TGGAAGACGA AAACAAAAAG GAGAAAGCCG TCCAACACTT CGAAAACCTC TTCAGTCAGG CCTTCGTGGC CATGAGGGAG CACCTCAAGG ACGATGGGGT TCTTGTCACC TACTACGCTC ACACAGATCC GGGAAGCTGG ATAAACCTCA TTGAAGCCGG CTGGAGGCGA GCAAGACTTC AAATAACGAG GGCAATCCCT CTGACAACCG AATCGGAAAC GAGCATCGTG AGCAGGGGTA AAATGAGTCT CGACACCTCG ATCGTTGCTG TCTGGAGAAA ACAGAAAGAG GAAAAAACCG TTCAAATATC GACTCTGAAG GAAGAGATCG AGAGAAAAGC AAAGTCTTCG GCTCGTGAAT TCATAGAGTA TGGTTATGAA GGACTCGATC TGCTGTACGG TGTGATGGCG GCTGCCCTCG AGGAAGTCAC AAAGTACAGA GAAATTTCCT CCCTGAAAGG TCCTCTTACG ACAGAAGAAA TTTTGAACGA ATACGTATAT CCCGCTACTA TAAGAGGAAT TGTGAACGCG ATTGCGGAGA TCGAAGGAAC TGGAACACTC CATTCTGGCA CTGCCATGTT CTACACCGCT TACAAGATAC TATTTGGAAA CGCATCTTTG AGCGCGAACG ATATCGTCCT CCTCAGGCTC GCAACATCAA CAGATCCGAG TGAGTTGATC AGCAGTGGAG TTCTCAAAGA GAAGAGATCT TCAAGCAGTA AAGAATACAC GCTCTACACA CCGGATCTTC TCGGCAAGAA AGCTTTGGAC ACGAAGGAGT TCCAGAAGTT CCTTCATGAA AAGAAACTCG ATCCGGTAGA GCCAAAACCA AAAAACAGTG TGGATGTGCT TCAGCTTCTC GAGTACTACT CACTGCTTGG ACGCTCCAGA GTGAAAGAGG AGATAGAAAA ACTCAGAAAA ATGTGGGCTG GTGAGGTAGA AGAGGCCCTT TTTATCGCGA GGCTCGTGTC TGAGTACTAC GCGGAGATCT ACATCAGGAA AATAGATCCT GTCAGGAGGA TGAAAGAAGA GTTCGCTTCA GAGATAGACA GGGAGCTGGA AAAGGACGGA TTCCTCGAAG TCGTTTTGAT GAGAAGACTC CTGGGTTATG TAGGGGGTGC TGTGTGA
|
Protein sequence | MKTFLESLKF PVQEVNKKSS GEKGPGRPPY WEMVFYWTRK PLVGARSVIA GALLPENVDE NLFKAAVRLS SPTPHRENPQ IPAEFAKYFE GKKLLDPFAG FGSIPLEGLR LGLDVTAVEL LPTTYIFLKA VLEYPKKFGK SLVKDVERWG EWITEQLKND PEIRELYDDD VAVYIGTWEI RCPHCGRWTP AIGNFWLARV KDGKGYKRLA YMKPERKGDE IEIRVIDLNE ILDDISKANV DGNEIIFEGE NYVKTVEEAV RDGKLKQSDV KIDGNTVIFE VPSANIELRR SQLTCLMCGN VIKYADENGN HHMKLKNGDF YVKFALRKYH EGDERFARQR LLVKVKVKDG DLIFEPATKE DSEKLWKAKE KVREMLEKGD LDVPSEAIPL YENRRITPIL SAEKWYQFFN PRQLLTLIKI VRLIREVGRK VEEEKIAEGW NKERAFEYAE AVATYLSTAM LKYAYYNSIV TRWDSTWWKI GETMSTRGIA MNWNWTESPW FSSFGGMIKT LLAILRGVKY LTSALSSSQR TLADFTENSV KVLQGDATSL NLGEKFDVIV TDPPYADDVP YTELSDFYYV WLKRALSDVE NGKLIPRFHK EAFFKRIGPK WVEIKTQWQE FAKKEVSTNP GRFMEDENKK EKAVQHFENL FSQAFVAMRE HLKDDGVLVT YYAHTDPGSW INLIEAGWRR ARLQITRAIP LTTESETSIV SRGKMSLDTS IVAVWRKQKE EKTVQISTLK EEIERKAKSS AREFIEYGYE GLDLLYGVMA AALEEVTKYR EISSLKGPLT TEEILNEYVY PATIRGIVNA IAEIEGTGTL HSGTAMFYTA YKILFGNASL SANDIVLLRL ATSTDPSELI SSGVLKEKRS SSSKEYTLYT PDLLGKKALD TKEFQKFLHE KKLDPVEPKP KNSVDVLQLL EYYSLLGRSR VKEEIEKLRK MWAGEVEEAL FIARLVSEYY AEIYIRKIDP VRRMKEEFAS EIDRELEKDG FLEVVLMRRL LGYVGGAV
|
| |