Gene EcSMS35_2927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2927 
SymbolgudP 
ID6143995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3003999 
End bp3005351 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content52% 
IMG OID641617796 
Productglucarate permease 
Protein accessionYP_001744951 
Protein GI170684213 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00893] d-galactonate transporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTT TAAGTCAGGC TGCGAGCAGT GTGGAAAAAC GCACGAATGC TCGTTACTGG 
ATAGTGGTGA TGTTGTTTAT CGTCACATCC TTCAACTACG GCGACCGCGC TACGCTCTCT
ATCGCCGGTT CGGAAATGGC CAAAGATATC GGCCTTGATC CTGTGGGAAT GGGCTATGTG
TTCTCTGCTT TCTCATGGGC TTATGTTATC GGGCAGATCC CTGGTGGTTG GTTGCTGGAT
CGTTTTGGTT CAAAACGCGT CTACTTCTGG TCGATCTTTA TCTGGTCGAT GTTTACCTTG
CTGCAAGGCT TCGTCGATAT CTTTAGTGGA TTCGGCATTA TCGTTGCCCT GTTTACGCTG
CGCTTCCTGG TCGGGCTTGC TGAAGCGCCC TCTTTCCCCG GCAACAGTCG CATTGTTGCG
GCCTGGTTTC CGGCGCAGGA AAGGGGAACG GCGGTGTCGA TTTTTAACTC CGCTCAATAC
TTCGCAACGG TGATCTTCGC GCCGATTATG GGCTGGCTGA CGCATGAAGT GGGCTGGTCA
CACGTCTTTT TCTTTATGGG CGGTCTGGGC ATTGTCATCA GCTTTATCTG GTTGAAAGTC
ATCCACGAGC CAAATCAACA TCCGGGTGTA AATCAGAAAG AGCTGGAGTA CATCGCCGCG
GGTGGCGCGC TGATCAATAT GGATCAGCAA AACACCAAAG TTAAAGTGCC GTTCAGCGTG
AAGTGGGGGC AGATCAAACA GCTGCTCGGG TCACGGATGA TGATCGGCGT TTATATCGGT
CAGTACTGTA TCAACGCCCT GACTTACTTC TTTATTACCT GGTTCCCGGT TTATCTGGTG
CAGGCGCGTG GGATGTCGAT TCTGAAAGCG GGCTTTGTGG CTTCCGTTCC GGCGGTTTGC
GGTTTTATCG GCGGTGTGCT GGGTGGGATT ATTTCCGACT GGCTGATGCG CCGCACGGGA
TCGCTGAACA TTGCGCGTAA AACACCGATC GTAATGGGCA TGTTGCTGTC GATGGTGATG
GTGTTCTGCA ACTACGTCAA CGTTGAGTGG ATGATCATCG GCTTTATGGC GCTGGCCTTC
TTCGGTAAGG GCATCGGGGC GCTGGGTTGG GCAGTAATGG CAGATACCGC GCCAAAAGAG
ATCAGCGGTC TTTCCGGTGG CCTGTTCAAC ATGTTCGGTA ACATTTCTGG CATCGTCACG
CCAATCGCAA TTGGTTATAT CGTTGGCACG ACTGGCTCCT TTAATGGGGC GCTGATTTAT
GTTGGTGTTC ATGCCTTAAT CGCGGTACTG AGCTACCTGG TGTTGGTGGG CGATATCAAG
CGTATCGAGT TGAAACCTGT TGCGGGGCAA TAA
 
Protein sequence
MSSLSQAASS VEKRTNARYW IVVMLFIVTS FNYGDRATLS IAGSEMAKDI GLDPVGMGYV 
FSAFSWAYVI GQIPGGWLLD RFGSKRVYFW SIFIWSMFTL LQGFVDIFSG FGIIVALFTL
RFLVGLAEAP SFPGNSRIVA AWFPAQERGT AVSIFNSAQY FATVIFAPIM GWLTHEVGWS
HVFFFMGGLG IVISFIWLKV IHEPNQHPGV NQKELEYIAA GGALINMDQQ NTKVKVPFSV
KWGQIKQLLG SRMMIGVYIG QYCINALTYF FITWFPVYLV QARGMSILKA GFVASVPAVC
GFIGGVLGGI ISDWLMRRTG SLNIARKTPI VMGMLLSMVM VFCNYVNVEW MIIGFMALAF
FGKGIGALGW AVMADTAPKE ISGLSGGLFN MFGNISGIVT PIAIGYIVGT TGSFNGALIY
VGVHALIAVL SYLVLVGDIK RIELKPVAGQ