Gene Tpet_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0331 
Symbol 
ID5171583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp311454 
End bp313091 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content47% 
IMG OID640562834 
Productputative manganese-dependent inorganic pyrophosphatase 
Protein accessionYP_001243936 
Protein GI148269476 
COG category[C] Energy production and conversion 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00550254 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAAAGAG TGTACGTGAT AGGACACAAA AATCCGGATA CCGACAGTGT CTGCTCTGCG 
ATAGGGTACG CGCACTTCAA AAACAATGTG GAAAAGGGAA AAACATTCAT TCCAGCTCGA
AGTGGTGAAC TCACGAACGA GTCTCTCTTC GTACTCAAAT ATTTTGGAGT AAAGCCTCCC
GTTCTCATTG AAACGCTCGA ACCTACCGTT GAAGATCTCG AGCTGAAAAA CCCCATTTTT
GTCTCTCCGA ACACTCCGGT TTACGACGTG GCCATGCTCA TGGAAAGCAA AGGAATAAAA
AACGTTCCGG TGGTTTCAAA GGAGAAAATG ATAGGAGTGG TCACTGAAAG CAACATTGCT
CGAGTCTACG TGAGGAGATT GAAGATAGAA CCATTGGTTA TACACCCGGT TCCGTTCGAT
CAGCTAGTAA GGATACTGAA AGCAGAGGTC GTGTGTGACC ACATGAAAGA AAAAATCGTG
GCTGGAAAGG TTCACATAGC GGTGGATGCC CTTCATGTGC TCCTGGGAAA GATAGAGATA
GGTGACGTGG TGATCGTGGG AGACAACGAA CCGGCCCAGA TCGCTCTTCT GGAAAAAGGA
GCAAAACTCA TGATAGTCGT GAACAACGCC CCAGTATCAA ACAGGGTGCT TGAAATAGCA
AAAGAAAAGA ACGCCGCCGT TTTGAGAGTG AAGTTCGACG CGTTCGGCGC CGCAAAGCTC
ATAAACCTTT CACTCCCCGT GACCCTCGTG ATGAGCAAGA AATTCCCCAC GGTGACGAAG
AAGGACACAC TCGAAGAAGT AAAAGAGATC GTCTTCAACT CGAAGATAAG AGCAGCGTTC
GTAGAAGACG AGAAAGGGCG GCTTTGTGGT GTTATAACTA GGACGGACCT GCTCAAAGAT
GTGAGGAAAA AAGTGATCCT CGTGGATCAC AACGAGATCA CCCAGGCACC GGAAGGAGTC
GAGAAAGCGG AAATCCTCGA GATCATAGAC CATCACAGGC TCGGTGGACT GAGCACTCTG
AATCCCGTTT TCTTCTACAA CGAACCCGTC GGAAGCACCT CAACAATAGT TGCAGAGTTC
TTCTTGAAAA ACGGTGTGAA GATGGAAAGG GAGATAGCCG GAATTCTGCT CTCGGGCATC
GTTTCCGATA CACTCTTCTT CAAACTCTCC ACGACGACAG AGAAAGACAG GAAGATGGCG
AATCTCCTGG CTGATGTTGC CAAACTCGAT CTGGAAAAAT TTGCGAAGAA ACTGTTGAAA
GAAGGGATGA AGATACCGGA AGACGTCGAT CCCGCTGAAC TGCTGAAGCG CGACGTGAAA
GTCTACGAGA TGGAAGAAGA ATCCTTCGCC GTTTCTCAGA TAATGACGTC GGACTTTTCA
ACACTTTTGA AAGAAAAGGA ACGTTTCACG AACGCACTGA AGACCCTCAA GGGAGAATTC
GGTGTCAAAC ATTTCTTTGT GCTCTTCACG AATCCTGTGG AAGAAGCGAG TCTTCTGATG
ATGGATGGAG ATCAAAAGTT AGTGGAAAAA GCCTTCAACG CGGAAAAGAA GGACGGTCTC
TTTCTGTTGA AGGGGGTTAT GTCCAGAAAG AAGGACTTCG TTCCCAAGAT CGGTGAGGTG
CTGAGAAGGG AGAGATGA
 
Protein sequence
MERVYVIGHK NPDTDSVCSA IGYAHFKNNV EKGKTFIPAR SGELTNESLF VLKYFGVKPP 
VLIETLEPTV EDLELKNPIF VSPNTPVYDV AMLMESKGIK NVPVVSKEKM IGVVTESNIA
RVYVRRLKIE PLVIHPVPFD QLVRILKAEV VCDHMKEKIV AGKVHIAVDA LHVLLGKIEI
GDVVIVGDNE PAQIALLEKG AKLMIVVNNA PVSNRVLEIA KEKNAAVLRV KFDAFGAAKL
INLSLPVTLV MSKKFPTVTK KDTLEEVKEI VFNSKIRAAF VEDEKGRLCG VITRTDLLKD
VRKKVILVDH NEITQAPEGV EKAEILEIID HHRLGGLSTL NPVFFYNEPV GSTSTIVAEF
FLKNGVKMER EIAGILLSGI VSDTLFFKLS TTTEKDRKMA NLLADVAKLD LEKFAKKLLK
EGMKIPEDVD PAELLKRDVK VYEMEEESFA VSQIMTSDFS TLLKEKERFT NALKTLKGEF
GVKHFFVLFT NPVEEASLLM MDGDQKLVEK AFNAEKKDGL FLLKGVMSRK KDFVPKIGEV
LRRER