Gene TRQ2_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_0349 
Symbol 
ID6091753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp331498 
End bp333144 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content48% 
IMG OID642487526 
Productputative manganese-dependent inorganic pyrophosphatase 
Protein accessionYP_001738388 
Protein GI170288150 
COG category[C] Energy production and conversion 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.789821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGCAT TGGAAAGAGT GTACGTGATA GGCCACAAGA ATCCGGACAC CGACAGCGTT 
TGCTCTGCGA TAGGGTACGC GCACTTCAAA AACAATGTGG AAAAGGGAAA AACATTCATT
CCAGCTCGAA GCGGCGATCT CACAAACGAA TCCCTCTTCG TGCTCAAATA TTTCGGAATG
AATCCACCCC TCCACATTGA AACGCTCGAA CCCACCGTTG AAGATCTCGA GCTGAAAAAT
CCCATTTTCG TCACTCCGGA CACATCCGCC TACGATGTGG CCATGCTCAT GGAAAGCAGA
GGAATAAAAA ACGTTCCGGT GGTTTCAAAG GAGAAAATGA TAGGAGTGGT CACTGAAAGC
AACATTGCTC GAGTCTACGT GAGGAGATTG AAGATAGAAC CATTGGTTAT ACACCCGGTT
CCGTTCGATC AGCTAGTCAG GATACTGAAA GCAGAGGTTG TCTGTGACTA CATGAAAGAA
AAAACCGTAT CCGGAAAGGT TCACATAGCG GTGGATGCCC TTCATGTGCT CCTGGGAAAG
ATAGAGATAG GTGACGTGGT GATCGTGGGA GACAACGAAC CGGCCCAGAT CGCTCTTCTG
GAAAAAGGAG CAAAACTCAT GATAGTCGTG AACAACGCCC CAGTATCAAA CAGAGTGCTT
GAAATAGCAA AAGAAAAGAA CGCCGCCGTT TTGAGAGTGA AGTTCGACGC GTTCGGCGCC
GCAAAGCTCA TAAACCTTTC ACTCCCCGTG ACCCTCGTGA TGAGCAAGAA ATTCCCCACG
GTGACGAAGA AGGACACACT CGAGGAAGTA AAAGAGATCG TCTTCACCTC GAAGATAAGA
GCAGCGTTCG TAGAAGACGA GAAAGGGCGG CTTTGTGGTG TTATAACTAG GACGGACCTG
CTCAAAGATG TGAGGAAAAA GGTGATCCTT GTGGATCACA ACGAGATCAC CCAGGCACCG
GAAGGAGTCG AGAAAGCGGA AATCCTCGAG ATCATAGACC ATCACAGGCT CGGTGGACTG
AGCACTCTGA ATCCCGTTTT CTTCTACAAC GAACCCGTCG GAAGCACCTC AACAATAGTG
GCAGAGTTCT TCTTGAAAAA CGGTGTGAAG ATGGAAAGGG AGATAGCCGG AATTCTGCTC
TCGGGTATCG TTTCCGATAC ACTCTTCTTC AAACTCTCCA CGACGACAGA GAAAGACAGG
AAGATGGCGA ATTTCCTGGC TGATGTTGCC AAACTCGATC TGGAAAAATT CGCGAAGAAA
CTGCTGAAGG AAGGGATGAA GATACCGGAA GACGTCGATC CCGCTGAACT GCTGAAGCGC
GACGTGAAAG TCTACGAGAT GGGAGAAGAA TCCTTCGCCG TTTCTCAGAT AATGACGTCG
GACTTTTCAA CACTTTTGAA AGAAAAGGAA CGTTTCACGA ACGCACTGAA GACCCTCAAG
GGAGAATTCG GTGTCAAACA TTTCTTTGTG CTCTTCACGA ATCCTGTGGA AGAAGCGAGT
CTTCTGATGA TGGATGGAGA TCAAAAGTTA GTGGAAAAAG CCTTCAACGC GGAAAAGAAG
GACGGTCTCT TTCTGTTGAA GGGGGTTATG TCCAGAAAGA AGGACTTCGT TCCCAAGATC
GGTGAGGTGC TGAGAAGGGA GAGATGA
 
Protein sequence
MKALERVYVI GHKNPDTDSV CSAIGYAHFK NNVEKGKTFI PARSGDLTNE SLFVLKYFGM 
NPPLHIETLE PTVEDLELKN PIFVTPDTSA YDVAMLMESR GIKNVPVVSK EKMIGVVTES
NIARVYVRRL KIEPLVIHPV PFDQLVRILK AEVVCDYMKE KTVSGKVHIA VDALHVLLGK
IEIGDVVIVG DNEPAQIALL EKGAKLMIVV NNAPVSNRVL EIAKEKNAAV LRVKFDAFGA
AKLINLSLPV TLVMSKKFPT VTKKDTLEEV KEIVFTSKIR AAFVEDEKGR LCGVITRTDL
LKDVRKKVIL VDHNEITQAP EGVEKAEILE IIDHHRLGGL STLNPVFFYN EPVGSTSTIV
AEFFLKNGVK MEREIAGILL SGIVSDTLFF KLSTTTEKDR KMANFLADVA KLDLEKFAKK
LLKEGMKIPE DVDPAELLKR DVKVYEMGEE SFAVSQIMTS DFSTLLKEKE RFTNALKTLK
GEFGVKHFFV LFTNPVEEAS LLMMDGDQKL VEKAFNAEKK DGLFLLKGVM SRKKDFVPKI
GEVLRRER