Gene Tpet_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0572 
Symbol 
ID5170437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp570383 
End bp571861 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content49% 
IMG OID640563079 
Productbifunctional shikimate kinase/3-dehydroquinate synthase 
Protein accessionYP_001244169 
Protein GI148269709 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase
[COG0703] Shikimate kinase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000527304 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATCT TCCTCGTTGG GATGATGGGT TCTGGAAAGA GTACGATCGG TAAAAGGGTT 
TCCGAGGTGC TCGACCTTCA GTTCATAGAT ATGGACGAAG AGATAGAAAG AAGAGAGGGA
AGAAGAGTTC GAAGGATTTT CGAAGAAGAC GGTGAAGAGT ACTTCCGATT GAAAGAGAAA
GAACTTCTTA GGGAACTTGT GGAAAGAGAC AACGTGGTCG TGGCAACGGG TGGAGGTGTT
GTGATCGATC CAGAGAACAG GGAGCTTTTG AAGAAAGAGA AGACTCTCTT TCTCTACGCT
CCCCCTGAAG TGTTGATGGA AAGAGTAACA ACAGAGAACA GGCCTCTTCT GAGAGAAGGA
AAAGAGAGAA TACGAGAGAT CTGGGAAAGG AGAAAACAGT TCTACACGGA GTTTAGAGGG
ATCGACACCT CCAAGTTGAA CGAGTGGGAA ACAACCGCAC TCGTTGTGCT GGAGGCTCTG
GACGAGAAAG AAATCTCAAC GATAGAAAAA CCACACCTGG TGAAGATCAT CCTCGGTGGT
TTCAAGAGGG TGAGGAACGA AGAGCTGGTT TTCACCACGG AGAGGGTGGA GAAGATATAC
GGAAGGTACC TTCCAGAGAA TCGGCTTCTT TTTCCGGATG GAGAGGAAGT GAAGACGCTG
GAGCATGTCT CCAGAGCGTA CTACGAACTT GTGAGGATGG ACTTTCCCAG GGGAAAGACC
ATAGCGGGTG TCGGAGGAGG TGCTCTCACC GACTTCACTG GCTTTGTGGC GAGCACGTTC
AAAAGGGGAG TGGGACTTTC TTTCTATCCG ACAACACTTC TGGCTCAGGT GGACGCTTCC
GTTGGTGGAA AGAATGCCAT CGATTTCGCT GGAGTGAAAA ACGTCGTTGG GACTTTCAGA
ATGCCAGACT ACGTCATCAT AGATCCCACC GTCACGCTTT CGATGGATGA GGGCAGGTTC
GAAGAGGGAG TCGTGGAAGC CTTCAAGATG ACGATTCTAT CGGGTCGCGG GGTAGAACTC
TTCGATGAGC CGGAGAAGAT TGAGAAGAGA AATCTCAGAG TTCTCAGCGA GATGGTAAAA
ATCTCCGTCG AAGATAAAGC GAGGATAGTA ATGGAAGATC CCTACGACAT GGGTTTGAGA
CACGCCCTGA ATCTGGGACA CACGCTCGGT CATGTGTACG AGATGCTGGA AGGGGTACCT
CACGGTATAG CGGTAGCGTG GGGCATCGAA AAAGAGACGA TGTACCTGTA CAGAAAGGGA
ATAGTGCCTA AGGAAACCAT GAGATGGATC GTAGAAAAGG TCAAACAGAT CGTACCAATT
CCTGTTCCAT CCGTCGATGT TGAGAAAGCC AGAAATCTCA TTCTGAACGA CAAGAAGATC
CTGAAAGGTT CCAGAGTTAG GCTTCCTTAC GTGAAAGAAA TCGGAAAGAT CGAATTCTTA
GAGGTCGATC CGCTCGAACT TTTGGAGGTG GTAGATTGA
 
Protein sequence
MRIFLVGMMG SGKSTIGKRV SEVLDLQFID MDEEIERREG RRVRRIFEED GEEYFRLKEK 
ELLRELVERD NVVVATGGGV VIDPENRELL KKEKTLFLYA PPEVLMERVT TENRPLLREG
KERIREIWER RKQFYTEFRG IDTSKLNEWE TTALVVLEAL DEKEISTIEK PHLVKIILGG
FKRVRNEELV FTTERVEKIY GRYLPENRLL FPDGEEVKTL EHVSRAYYEL VRMDFPRGKT
IAGVGGGALT DFTGFVASTF KRGVGLSFYP TTLLAQVDAS VGGKNAIDFA GVKNVVGTFR
MPDYVIIDPT VTLSMDEGRF EEGVVEAFKM TILSGRGVEL FDEPEKIEKR NLRVLSEMVK
ISVEDKARIV MEDPYDMGLR HALNLGHTLG HVYEMLEGVP HGIAVAWGIE KETMYLYRKG
IVPKETMRWI VEKVKQIVPI PVPSVDVEKA RNLILNDKKI LKGSRVRLPY VKEIGKIEFL
EVDPLELLEV VD