Gene Tpet_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0940 
Symbol 
ID5170273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp960757 
End bp962280 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content45% 
IMG OID640563458 
Productphosphodiesterase 
Protein accessionYP_001244534 
Protein GI148270074 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000513456 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTGGT ACATAGTAGC AGGAGCCGGT GGTCTTCTGA TTGGATACTT AATAGCAAGT 
TATCAGATAA ATCAAAAGCT CAGAAAAGCG AAAGAAGACG CTCAAACCAT AATCGAGAAA
GCAGAAAAAG AAGCAAACGA GATAAAAAAG AAAGCGATAA TAGAAGGAAG AGAGGAAGTT
CACAGGCTCA GAGAGGAATT CGAGAAAGAG CGATCCAGAA GAGAAGAAGA ATTGAGAGCG
TTTGAGGAAA GAATTTTAAA GAGAGAAGAA CTCCTGACGA GGAAAGAGGA AAACCTTGAA
AAAAGGGAAC ATCAGATTGA GGAACTCAAA GCAAATCTGG AAGAGAAAAT GAGAGAGGTC
GAAGAAAAGG AAAAAAGAAT CGACGAAGAA TTGAAGCGTC TTGCCGGTAT GACTGTGGAA
GAAGCAAGAG AGCTCATTCT CGAAGAAGCC AGGCAGAGGT ACGAACACGA CCTTGCGAAA
CTCTACAAAG AGATGAAAGA ACAGGTTGAA GAAGAAGTCG AAAAGGAAGC CAAGAAAGTC
ATAGCATTCG CTGTTCAGAG ATACGCACCG GATTATGTAG GAGAAATCAC CGTTTCTACC
GTTTCACTTC CGTCGGACGA CATGAAGGGA AGAATCATAG GCAGAGAAGG AAGAAACATC
AGAACGTTTG AAAAGATCAC GGGTGTGGAC TTGATAATAG ACGACACCCC GGAAGTGGTG
GTTCTTTCCT GCTTTAACCC TCTGAGAAGA GAGATAGCCC GTATCACACT GGAAAAACTG
GTGGCGGATG GAAGAATACA TCCTGCAAGG ATCGAAGAAA TGTACGAAAA AGCAAAGCAG
GAAGTGGAGA AAGCCATAAA AGAAGCTGGA CAAGAGGCGA CGTTCAAAGC GGGTGTTATG
GGGCTCCATC CGGAGCTTGT GAAACTCCTT GGAAAATTGA AATACAGAAC GAGCTATGGC
CAAAACGTTC TGAACCATTC GATAGAAGTG GCTCTGCTCG CCGGTTACAT GGCTTCTGAA
CTTGGGCTAA ACGCGGACAA AGCGAGAAGA GGAGGACTCC TCCACGATAT AGGAAAAGCT
GTTGATCAGG AGCTGGAAGG TTCACACACG ACCATAGGTG CAGAGCTTGC CCGAAGATAC
GGAGAAAAAG AAGACATCAT AAACATGATT CTCAGCCATC ACGGAGAAGA AGAACCCATG
ACACCTGAAG CAGTACTCGT GGCAGCTGCC GATGCTCTCT CTGCGGCAAG ACCAGGTGCG
AGAAGAGAGA GTCTGGAAAA CTACATCAAA CGTCTTATGA AGCTGGAGGA GATAGCAAAA
AGCTTCAAGT ACGTTGAAAA GGCCTACGCC ATTCAGGCTG GAAGAGAAAT TAGAGTGATT
GTGGAACCCG ACAAAGTGGA TGACGCTCTC GCTGAAAAAC TCGCCTACGA TATTTCTAAA
AAGATAGAAG AAGAGCTCGA GTATCCTGGT GTTCTGAAAG TTGTGGTGAT AAGAGAGAAA
AGAAGCGTCG CTTACGCAAA GTAA
 
Protein sequence
MLWYIVAGAG GLLIGYLIAS YQINQKLRKA KEDAQTIIEK AEKEANEIKK KAIIEGREEV 
HRLREEFEKE RSRREEELRA FEERILKREE LLTRKEENLE KREHQIEELK ANLEEKMREV
EEKEKRIDEE LKRLAGMTVE EARELILEEA RQRYEHDLAK LYKEMKEQVE EEVEKEAKKV
IAFAVQRYAP DYVGEITVST VSLPSDDMKG RIIGREGRNI RTFEKITGVD LIIDDTPEVV
VLSCFNPLRR EIARITLEKL VADGRIHPAR IEEMYEKAKQ EVEKAIKEAG QEATFKAGVM
GLHPELVKLL GKLKYRTSYG QNVLNHSIEV ALLAGYMASE LGLNADKARR GGLLHDIGKA
VDQELEGSHT TIGAELARRY GEKEDIINMI LSHHGEEEPM TPEAVLVAAA DALSAARPGA
RRESLENYIK RLMKLEEIAK SFKYVEKAYA IQAGREIRVI VEPDKVDDAL AEKLAYDISK
KIEEELEYPG VLKVVVIREK RSVAYAK