Gene EcSMS35_4932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4932 
SymboldeoB 
ID6147067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5047533 
End bp5048756 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content54% 
IMG OID641619735 
Productphosphopentomutase 
Protein accessionYP_001746839 
Protein GI170682473 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.912083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTG CATTTATTAT GGTGCTGGAC TCATTCGGCA TCGGCGCTAC AGAAGATGCA 
GAACGCTTTG GTGACGTCGG GGCTGACACC CTGGGTCATA TCGCAGAAGC TTGTGCCAAA
GGCGAAGCTG ATAACGGTCG TAAAGGCCCG CTCAATCTGC CAAATCTGAC CCGTCTGGGG
CTGGCGAAAG CACACGAAGG TTCTACCGGT TTCATTCCGG CGGGAATGGA CGGCAACGCT
GAAGTTATCG GCGCGTACGC ATGGGCGCAC GAAATGTCAT CCGGTAAAGA TACCCCGTCT
GGTCACTGGG AAATCGCCGG TGTCCCGGTT CTGTTTGAGT GGGGATACTT CTCCGATCAC
GAAAACAGCT TCCCGCAAGA GCTACTGGAT AAACTGGTCG AACGCGCTAA TCTGCCGGGT
TACCTCGGTA ACTGCCACTC TTCCGGTACG GTCATTCTGG ATCAGCTGGG CGAAGAGCAC
ATGAAAACCG GCAAGCCGAT TTTCTATACC TCCGCTGACT CCGTGTTCCA GATTGCCTGC
CATGAAGAAA CTTTCGGTCT GGACAAACTC TACGAACTGT GCGAAATCGC TCGTGAAGAG
CTGACCAACG GCGGCTACAA CATCGGTCGT GTTATCGCTC GTCCGTTTAT CGGCGACAAA
GCCGGTAACT TCCAGCGTAC CGGTAACCGT CACGATCTGG CTGTTGAACC GCCAGCACCG
ACCGTGCTGC AGAAACTGGT TGATGAAAAA CACGGCCAGG TGGTTTCTGT CGGTAAAATT
GCGGACATCT ACGCTAACTG CGGCATCACC AAGAAAGTGA AAGCGACTGG CCTGGACGCG
CTGTTTGACG CCACCATCAA AGAGATGAAA GAAGCGGGTG ATAACACTAT CGTCTTCACC
AACTTCGTTG ACTTCGACTC TTCCTGGGGC CACCGTCGCG ACGTTGCCGG TTATGCTGCG
GGTCTGGAGC TGTTCGACCG TCGTCTGCCG GAGCTGATGT CTCTGCTGCG CGATGACGAC
ATCCTGATCC TCACCGCTGA CCACGGTTGT GATCCGACCT GGACCGGTAC TGACCACACG
CGTGAACACA TTCCGGTACT GGTATACGGC CCGAAAGTAA AACCGGGCTC ACTGGGTCAC
CGTGAAACCT TCGCGGATAT CGGCCAGACG CTGGCAAAAT ATTTTGGTAC TTCTGATATG
GAATATGGCA AAGCCATGTT CTGA
 
Protein sequence
MKRAFIMVLD SFGIGATEDA ERFGDVGADT LGHIAEACAK GEADNGRKGP LNLPNLTRLG 
LAKAHEGSTG FIPAGMDGNA EVIGAYAWAH EMSSGKDTPS GHWEIAGVPV LFEWGYFSDH
ENSFPQELLD KLVERANLPG YLGNCHSSGT VILDQLGEEH MKTGKPIFYT SADSVFQIAC
HEETFGLDKL YELCEIAREE LTNGGYNIGR VIARPFIGDK AGNFQRTGNR HDLAVEPPAP
TVLQKLVDEK HGQVVSVGKI ADIYANCGIT KKVKATGLDA LFDATIKEMK EAGDNTIVFT
NFVDFDSSWG HRRDVAGYAA GLELFDRRLP ELMSLLRDDD ILILTADHGC DPTWTGTDHT
REHIPVLVYG PKVKPGSLGH RETFADIGQT LAKYFGTSDM EYGKAMF