Gene Dret_2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2015 
Symbol 
ID8419860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2315311 
End bp2317110 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content56% 
IMG OID645038603 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_003198877 
Protein GI258406135 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.475884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAC ATATTTTGTC TGGAATCCCT GTCTCCACCG GCATTACTAT CGGCCGGGTC 
TATTATCTCA ACCGGGGACG GTTTTGTCTC AGTGCCAGAC AGACCGTGGC CGAAGGGCTT
GTGGACCAGG AAATCCATCG GCTCAAGTAC GCCTTTGACC AAGCCCTGGT CGAGCTTGAG
GAGATACGGG CCAAGGTCCC TGAGGAACTC CGGGAGCACG CCTCGATCAT CGATTCGCAT
TTGATGATCC TGCGCGATCC GAAACTGCGC CAGTCGGCCC AGAACTATGT CCGCGAAATG
CGTTTGAACG CGGAATGGGC TCTGGAAAAG GCGGTTGACG ATATCGAGAA GGTCTTTGCC
AGCATCGAGG ATGAATATAT CCGGGACCGG ATTCTGGATG TTCGCTCTGT GGCCGAGCGG
GTCTATAAAC AAATGCTCGG TTGCGAGGAG GGCCCGAAGG CGATCACCAG TCGGGTCATT
CTGGTGGCCC ATGACCTGAC TCCCGCTGAT ACCATCGAGA TGGAGGTGGA CAAGATCATG
GGGTTTGCCA CCGCCCTGGG CGGGAAGACC TCGCATGTAG GCATTCTGGC CCGGTCGCTG
CAGATTCCAG CCGTGGTGGG GGTGACCGAC CTTGAAGATT CGATTCAGGA TGACGACGAG
GTCATCATCG ACGGTCTGCA GGGCAAAGTG TTCATTGCTC CCGACAATGA GGAACTGGCT
CAGTATACCG AACTCAAGGA TCAGTTCGAG GCCTACCAAA GCACGATCAT GCGCAGCTGC
CACTTGCCCG GGGAGACCAT CGACGGGTAC CGGGTCAACG TGCTGGCGAA TATCGAATTG
TTCGAAGAGG TGACCTCGGT CCTTGATCAC GGCGGGGAGG GCATTGGCCT GTACCGGACT
GAATACAGTT ACCTCAATCG CAATGAACTG CCTTCGGAAA ATGAACTCTT TGAGGAGTAT
TGGGACCTGG CCTCCATCGT CGCCCCGGAG CGTCTGGTCA TCCGCACCCT CGATCTGGGC
GGCGACAAAT TGAGCGATCT TTTCGGGCAC CTGGAGGAGG CCAATCCGGC CCTCGGATTG
CGGGCGATCC GGTTTTGCCG CCAGTACCCC TATTTGTTCC GCACGCAATT GCGGGCCATT
CTCCGGGCCA GTGTGACCGG GAATGTTTCG ATCATGTTTC CCATGGTTTC CGGACTCAAT
GAGCTGGTCG AGCTGAAGCG GTTTGTCGAG GGCGTCAAAC AGGAATTGCG CCGCGAAAAT
ATCGCCTACA ATCCGGATAC CCCCATCGGA ATCATGGTCG AGCTGCCGTC CGCAGTGATG
ACCGCGGATA TCCTGGCCAA AGAAGTCGAT TTTTTCAGTA TCGGGACCAA CGATCTCATC
CAATACTCAC TGGGGATCGA CCGGACCAAT AAATACGTCT CCTATCTGTA TCAGCCGCTG
CACCCGGCCC TGTTGCGCAG CATCAAGTCT GTTGTCGACG CGGGACACCA GGCCGGTATC
GAAGTCAGCC TCTGCGGCGA AATGGCCTCG GACCCGTTTT GCGTGCCCAT TTTGATGGGC
ATGCAAGTGG ACAATTTGAG CATCAATCCG CAATCGATCC CGGGAATCAA GCGGATCATC
CGCAACGCGA CCATGGAAGA ATGCGCCTAT CTGCTCAAAC AGGTCATCAA TAGTTCGTCG
GTGGCCAAAA ATAACGCCCT GGTCCAGGAC ATTATCTTCA AGCGTTTTCC GGAAGAAATC
ATGTTCTATT CCTCCATGCT CGGCAACGAT GAGGAGCGCC CGGGCGGATT ATTCGGGTAA
 
Protein sequence
MARHILSGIP VSTGITIGRV YYLNRGRFCL SARQTVAEGL VDQEIHRLKY AFDQALVELE 
EIRAKVPEEL REHASIIDSH LMILRDPKLR QSAQNYVREM RLNAEWALEK AVDDIEKVFA
SIEDEYIRDR ILDVRSVAER VYKQMLGCEE GPKAITSRVI LVAHDLTPAD TIEMEVDKIM
GFATALGGKT SHVGILARSL QIPAVVGVTD LEDSIQDDDE VIIDGLQGKV FIAPDNEELA
QYTELKDQFE AYQSTIMRSC HLPGETIDGY RVNVLANIEL FEEVTSVLDH GGEGIGLYRT
EYSYLNRNEL PSENELFEEY WDLASIVAPE RLVIRTLDLG GDKLSDLFGH LEEANPALGL
RAIRFCRQYP YLFRTQLRAI LRASVTGNVS IMFPMVSGLN ELVELKRFVE GVKQELRREN
IAYNPDTPIG IMVELPSAVM TADILAKEVD FFSIGTNDLI QYSLGIDRTN KYVSYLYQPL
HPALLRSIKS VVDAGHQAGI EVSLCGEMAS DPFCVPILMG MQVDNLSINP QSIPGIKRII
RNATMEECAY LLKQVINSSS VAKNNALVQD IIFKRFPEEI MFYSSMLGND EERPGGLFG