Gene Dret_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1010 
Symbol 
ID8418833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1185938 
End bp1188115 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content59% 
IMG OID645037580 
Productpyruvate phosphate dikinase PEP/pyruvate- binding 
Protein accessionYP_003197876 
Protein GI258405134 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0583662 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAGT CGATCCCGGG CGCGTCCTTC GAGACCGACC CAGTTGTCCT GAGCACAGAA 
CTGCCACCGC ACCGCCTTGC CGAAGCCGGA GGCAAGGCCC GCGGTCTGCT GACCCTTTTG
CACAACTCCA TAGCCATCCC GCGAACCGCC TATCTCCCGG TTGGTGCCTA CCGCGCTTTT
GTGCAGGCGA GCGGACTGGA GCAGATGCTT TTTCGCCTCA CCCACGGACT GGACCTGGAG
ACCCGGCGCT GGGAAGAGAT CTGGGACGCT TCCCTGCGAA TTCGCAACGC CTTCGAACGC
ACCGCATTTC CACCTGAGCT TAAGCAGCGC CTGGGGCGGA CGATTGAGCA CATCTTTGCG
GACAAGCCGC TCGCCATCAG GTCCTGTGCA CCGGCCGAAG ACGGAGCCCA TTCGCATGCC
GGGTTGCATG AATCGTATAT CAATATCCGG GGCGAAACGG AGGTATTGGT GGCGTTACGG
CGGGTCTGGG CATCGTTATG GTCTGACCGG GCGCTCATGT ATCGCCGCGA ATTGGGGTTG
GATCCCGCCC ATGCCGGAAT GGGAGTCCTG ATGCAGGAAA TCGTTCCCAG CGACAAGTCC
GGAATCATTT TCACGCAAGG TCCACTTGAC GAAACCCAGA CCATTGTCGA GGCCGCTCCG
GGATTGCCGG GGGGAATCGT TGACAACGCC GTGGACACAG AGCGTTTCGT GGTCGACCGG
CGCACCGGAC GTGTGGAGGA ACACACCGCC TCGCAAGCGA CGCATAAACT TGTCCTCGGG
GAACAGGGAA CGGTTTTGGT CCCGCTTTCT GCTCCCGTCA AGGATATTTT GAATTCTGAA
GAACTCCAGC AGGTGCTGGA AATGGGTCGA CGGGCCGAGG GACTGTTCGG AACAGCCCAG
GACATGGAGT GGGCCTTCGC CAAAAAACGT CTTTTTGCCC TGCAGTCCCG CCCGATAACC
AGTCACGTTG AACCGAAAAC AGATGCCCTG TGGCAGGCCA CGGACAAGCG GCCATGGTAT
CTGAGTTTGA CCCGCAGTTT TGCAAACTTG CAAGCCCTGC GCGAGCGCAT CGAACACACC
ATCCTGCCAG AGATGAGTGC CGAAAGCGAA ACACTGCTCC AACGCGACCT GCCCGCACTA
TCCTGCGACG AACTGGACCG GGAAGTTCGC CACCGAAGGG CACGCCTTGA TCATTGGCAA
ACGGTCTACT GGCAGGACCT CATTCCCTTT GCCCACGGTA TCCGTTTGTT CGGCGTCCTC
TACAACGACA CGGTCCACCC GGAAGATCCG TTTGAATTCA CTCAACTCCT GGCAGGAGAA
GATCTCCTCG CCCTGGAGCG CAATGCTCTT TTAGACCATC TGGCTGAGAT GGTGGCCGAA
GACGATGACC TGCGCCACGA CCTGGAGGAT CATATCCTCC CGTCCACCGG TCCATTCGCC
CAGCAGCTTG AAACATTTAT TTCCCGCTTC GGAGACTTGG CCTGCAGCAC GACCTGGTGC
CAGGAGGGCC CCTGGGGAGT ACTGCGGATA GTCCTCGATC TGGCCTCCTC ACCCCACAGG
CACGCAACAA AGCGACACCA GCAAAGCGCC AACCTAGAAG AACAATTCTT CCAGGCCATT
TCAAGGGAGC GCCGGAAGTG GGCCGCTGAA GTGCTCGACC TGGCGCGCGC GAGTTATCGA
TTCCGGGACG ACGACAATCT CTATCTCGGC AAGATTCAGG CTCGGTATTA CGAAGTCCTG
GATGAAGCAA GAAAACGCCG CGATTTTGAG CTCTGTGCCT GGCCGCAGGA CATTGAAGAC
CTGCTTCAGG ACCCCAGTGC CCATCGCGCT GGCCCGGCCA CAGTGGCACG TTCCACTACG
GAAACACTTT CAGGGACCCC AGCCTCGGCC GGCATTGCCA GAGGGCCGGT ACGGGTCATT
CACTCCCCGG AGGATTTATA TAGTGTACAA TCCGGGGATG TGCTCGTCTG TGACGCCCTT
GATCCGAACA TGACCTTTAT CGTACCGCTG GTGGCCGCAA TTATCGAGCG CCGCGGCGGC
ATGCTCGTGC ACAGTGCGAT TATCGCCCGG GAATACGGCA TCCCCTGCAT TACCGGCGTC
GTTGCCGCTA CCTCCCGTAT TCCGGATGGG ACGGCGGTTG CTGTGGACGG GTTCAAGGGC
ACTGTGACCT TCCTCTGA
 
Protein sequence
MSQSIPGASF ETDPVVLSTE LPPHRLAEAG GKARGLLTLL HNSIAIPRTA YLPVGAYRAF 
VQASGLEQML FRLTHGLDLE TRRWEEIWDA SLRIRNAFER TAFPPELKQR LGRTIEHIFA
DKPLAIRSCA PAEDGAHSHA GLHESYINIR GETEVLVALR RVWASLWSDR ALMYRRELGL
DPAHAGMGVL MQEIVPSDKS GIIFTQGPLD ETQTIVEAAP GLPGGIVDNA VDTERFVVDR
RTGRVEEHTA SQATHKLVLG EQGTVLVPLS APVKDILNSE ELQQVLEMGR RAEGLFGTAQ
DMEWAFAKKR LFALQSRPIT SHVEPKTDAL WQATDKRPWY LSLTRSFANL QALRERIEHT
ILPEMSAESE TLLQRDLPAL SCDELDREVR HRRARLDHWQ TVYWQDLIPF AHGIRLFGVL
YNDTVHPEDP FEFTQLLAGE DLLALERNAL LDHLAEMVAE DDDLRHDLED HILPSTGPFA
QQLETFISRF GDLACSTTWC QEGPWGVLRI VLDLASSPHR HATKRHQQSA NLEEQFFQAI
SRERRKWAAE VLDLARASYR FRDDDNLYLG KIQARYYEVL DEARKRRDFE LCAWPQDIED
LLQDPSAHRA GPATVARSTT ETLSGTPASA GIARGPVRVI HSPEDLYSVQ SGDVLVCDAL
DPNMTFIVPL VAAIIERRGG MLVHSAIIAR EYGIPCITGV VAATSRIPDG TAVAVDGFKG
TVTFL