Gene EcHS_A4587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4587 
Symbol 
ID5594711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4596673 
End bp4598034 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content52% 
IMG OID640923681 
Productmajor facilitator transporter 
Protein accessionYP_001461121 
Protein GI157163803 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAAAG AAAATATCAC CCTCGATCCG CGTTCTTCAT TTACTCCATC TTCGTCGGCA 
GATATTCCCG TGCCACCAGA TGGATTAGTT CAACGCAGTA CCCGAATTAA ACGCATTCAA
ACCACCGCCA TGTTGTTATT ATTTTTTGCG GCGGTAATCA ATTATCTCGA CCGCAGTTCG
CTGTCGGTAG CAAATTTAAC GATTCGTGAA GAATTGGGAT TAAGTGCCAC CGAAATCGGC
GCTTTGCTCT CCGTGTTTTC ACTCGCTTAC GGGATTGCGC AACTTCCTTG CGGCCCACTA
TTGGATCGTA AAGGCCCACG CCTGATGCTG GGACTGGGGA TGTTCTTCTG GTCACTGTTC
CAGGCAATGT CTGGCATGGT GCACAACTTT ACGCAGTTCG TGTTGGTGCG TATCGGTATG
GGGATTGGTG AAGCGCCGAT GAACCCATGC GGTGTAAAAG TCATTAACGA CTGGTTCAAC
ATCAAAGAGC GCGGACGCCC GATGGGCTTC TTCAACGCAG CTTCTACCAT TGGCGTTGCC
GTAAGCCCAC CGATTCTGGC GGCGATGATG CTGGTGATGG GCTGGCGCGG GATGTTTATT
ACCATTGGTG TACTGGGGAT TTTTCTCGCC ATCGGCTGGT ATATGCTCTA TCGCAACCGC
GAGCACGTAG AACTGACTGC CGTTGAACAA GCTTATCTCA ATGCAGGTAG CGTCAATGCC
CGCCGAGATC CGCTCAGTTT TGCCGAATGG CGCAGCCTGT TCCGTAACCG TACAATGTGG
GGAATGATGC TCGGATTCAG TGGCATCAAC TACACTGCGT GGCTGTATCT GGCCTGGCTT
CCTGGTTACC TGCAAACAGC CTATAACCTG GATTTAAAAA GCACAGGGTT GATGGCGGCT
ATCCCTTTCC TGTTTGGGGC TGCCGGGATG CTGGTCAACG GTTACGTTAC TGACTGGCTG
GTCAAAGGGG GAATGGCTCC GATTAAAAGC CGTAAGATCT GCATTATTGC CGGGATGTTC
TGTTCTGCCG CCTTTACGCT GATAGTACCA CAAGCGACAA CATCCATGAC GGCGGTTCTG
CTGATTGGCA TGGCACTGTT CTGTATTCAC TTTGCCGGAA CATCCTGCTG GGGCTTGATC
CACGTCGCAG TTGCTTCTCG CATGACTGCG TCGGTGGGCA GTATCCAGAA CTTTGCCAGC
TTCATCTGCG CCTCTTTTGC GCCGATCATT ACTGGTTTTA TTGTTGATAC CACCCACTCA
TTCCGTCTGG CACTAATCAT CTGCGGTTGC GTCACCGCAG CGGGGGCACT GGCGTACATC
TTCCTGGTTC GTCAGCCGAT CAACGACCCA CGTAAAGATT AA
 
Protein sequence
MEKENITLDP RSSFTPSSSA DIPVPPDGLV QRSTRIKRIQ TTAMLLLFFA AVINYLDRSS 
LSVANLTIRE ELGLSATEIG ALLSVFSLAY GIAQLPCGPL LDRKGPRLML GLGMFFWSLF
QAMSGMVHNF TQFVLVRIGM GIGEAPMNPC GVKVINDWFN IKERGRPMGF FNAASTIGVA
VSPPILAAMM LVMGWRGMFI TIGVLGIFLA IGWYMLYRNR EHVELTAVEQ AYLNAGSVNA
RRDPLSFAEW RSLFRNRTMW GMMLGFSGIN YTAWLYLAWL PGYLQTAYNL DLKSTGLMAA
IPFLFGAAGM LVNGYVTDWL VKGGMAPIKS RKICIIAGMF CSAAFTLIVP QATTSMTAVL
LIGMALFCIH FAGTSCWGLI HVAVASRMTA SVGSIQNFAS FICASFAPII TGFIVDTTHS
FRLALIICGC VTAAGALAYI FLVRQPINDP RKD