Gene Smed_0298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0298 
Symbol 
ID5321130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp317540 
End bp319360 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content64% 
IMG OID640789233 
Productphosphogluconate dehydratase 
Protein accessionYP_001325992 
Protein GI150395525 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG ATTCCCGCAT AGCCGCCATT ACCGCACGCA TCGTCGAACG CTCAAAGCCC 
TACCGGGAGC CCTATATTGA CCGTGTACGC GGGGCCGCAG CGAACGGGCC GCACCGCACG
GTGCTCGGCT GCGGCAACCT CGCGCATGGA TTTGCGGTCT GTTCCCCCGC CGAGAAGGTG
GCGCTTGCGG GCGACCGCGT GGCAAATCTC GGCATCATCA CCTCCTACAA CGACATGCTT
TCGGCGCATC AGCCGTTCGA AACCTATCCG GCGCTGATCC GTGAGGCCGC GCACGAGGCC
GGCGGTGTCG CGCAGGTCGC CGGCGGTGTG CCGGCCATGT GCGACGGCGT CACCCAGGGG
CAGCCCGGCA TGGAGCTGTC GCTCTTCTCG CGCGATGTCA TCGCCATGGC GGCCGGCATC
GGCCTGTCGC ACAACATGTT CGACGCGGCC GTCTATCTCG GCGTCTGCGA CAAGATCGTA
CCCGGGCTTG CGATCGCCGC GCTCACCTTC GGCCATCTGC CGGCGGTCTT CATTCCCGCC
GGGCCGATGA CCTCAGGCCT GCCGAACGAC GAGAAGGCCA AGATCCGGCA GCTCTTCGCC
GAGGGCAAGG TCGGCCGCGA CGAATTGCTG GAGGCGGAGT CCAAATCCTA TCACGGCCCC
GGCACCTGTA CCTTCTACGG CACCGCCAAT TCCAATCAGA TGCTGATGGA AATCATGGGC
TTCCACTTGC CCGGCGCTTC CTTCATCAAT CCCGGCACGC CGCTTCGCGA CGCGCTGACC
AAGGAAGCCA CGAAGCGGGC GCTTGCGATC ACGGCGCTGG GCAACGAATT CACCCCGGCC
GGCGAGATGA TAGACGAGCG GTCGATCGTC AACGGCGTCG TCGGCCTGCA TGCGACCGGC
GGTTCCACGA ACCACACGCT GCACCTTGTC GCAATGGCTC GCGCAGCGGG CATCGTGCTG
ACCTGGCAGG ACATTTCGGA GCTTTCCGAC CTAGTGCCGC TGCTCGCGCG CGTCTATCCG
AACGGGCTGG CTGACGTGAA CCATTTCCAC GCCGCAGGCG GGATGGGTTT CATGATCGCT
CAGCTCCTCA GCAAAGGCTT GCTGCATGAC GACGTCCGCA CCGTCTACGG CCAGGGGCTC
AGTGCCTATG CCATCGACGT GAAGCTCGGC GAAAAAGGCA GCGTCAAACG CGAGCCGGCG
CCGGCTGTGA GCGCCGATCC GAAGGTCCTC GCAACCATCG ACCGTCCGTT CCAGCACACT
GGCGGCCTCA AGATGCTGAG CGGCAATATC GGCAAGGCGG TCATCAAGAT CTCCGCCGTC
AAACCGGAAA GCCATGTCAT CGAGGCGCCC GCCAAGATCT TCAACGGCCA GGGTGAACTC
AACGCCGCCT TCAAGGCGGG TAAGCTCGAA GGCGATTTCG TGGCTGTCGT GCGCTTTCAG
GGCCCCAAGG CCAATGGCAT GCCCGAACTG CACAAGCTGA CGACGGTGCT CGGGATCCTG
CAGGACCGCG GCCAGAAAGT TGCAATACTC ACCGACGGGC GCATGTCCGG CGCGTCCGGC
AAGGTTCCGG CGGCAATCCA TGTGACCCCG GAAGCGAAGG AAGGCGGTCC GATCGCGCGC
ATCCAGGAGG GCGACATCGT CCGCATCGAT GCGATCAAAG GCAAGATCGA GGTGCTCGTC
GAGGATATCG CGCTTAAGAC GCGCGTGCCG GCGCATATCG ATCTGTCCGA CAACGAGTTC
GGTATGGGCC GCGAACTCTT CGCTCCCTTC AGGCAGATCG CCGGTGCAGC CGACCGTGGC
GGAAGCGTGC TCTTCAATTA G
 
Protein sequence
MSADSRIAAI TARIVERSKP YREPYIDRVR GAAANGPHRT VLGCGNLAHG FAVCSPAEKV 
ALAGDRVANL GIITSYNDML SAHQPFETYP ALIREAAHEA GGVAQVAGGV PAMCDGVTQG
QPGMELSLFS RDVIAMAAGI GLSHNMFDAA VYLGVCDKIV PGLAIAALTF GHLPAVFIPA
GPMTSGLPND EKAKIRQLFA EGKVGRDELL EAESKSYHGP GTCTFYGTAN SNQMLMEIMG
FHLPGASFIN PGTPLRDALT KEATKRALAI TALGNEFTPA GEMIDERSIV NGVVGLHATG
GSTNHTLHLV AMARAAGIVL TWQDISELSD LVPLLARVYP NGLADVNHFH AAGGMGFMIA
QLLSKGLLHD DVRTVYGQGL SAYAIDVKLG EKGSVKREPA PAVSADPKVL ATIDRPFQHT
GGLKMLSGNI GKAVIKISAV KPESHVIEAP AKIFNGQGEL NAAFKAGKLE GDFVAVVRFQ
GPKANGMPEL HKLTTVLGIL QDRGQKVAIL TDGRMSGASG KVPAAIHVTP EAKEGGPIAR
IQEGDIVRID AIKGKIEVLV EDIALKTRVP AHIDLSDNEF GMGRELFAPF RQIAGAADRG
GSVLFN