Gene EcolC_3699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3699 
Symbol 
ID6068690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4048299 
End bp4049660 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content52% 
IMG OID641603117 
Productmajor facilitator transporter 
Protein accessionYP_001726637 
Protein GI170021683 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0619657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAAG AAAATATCAC CCTCGATCCG CGTTCTTCAT TTACTCCATC TTCGTCGGCA 
GATATTCCCG TGCCACCAGA TGGATTAGTT CAACGCAGTA CCCGAATTAA ACGCATTCAA
ACCACCGCCA TGTTGTTATT ATTTTTTGCG GCGGTAATCA ATTATCTCGA CCGCAGTTCG
CTGTCGGTAG CAAATTTAAC GATTCGTGAA GAATTGGGAT TAAGTGCCAC CGAAATCGGC
GCTTTGCTCT CCGTGTTTTC ACTCGCTTAC GGCATTGCGC AACTTCCTTG CGGCCCACTA
TTGGATCGTA AAGGCCCGCG CCTGATGTTG GGACTGGGGA TGTTCTTCTG GTCACTGTTC
CAGGCAATGT CTGGCATGGT GCACAGCTTT ACGCAGTTCG TGTTGGTGCG TATCGGTATG
GGAATTGGTG AAGCGCCGAT GAACCCATGC GGTGTAAAAG TCATTAACGA CTGGTTCAAC
ATCAAAGAGC GCGGACGCCC GATGGGCTTC TTCAACGCAG CTTCTACCAT TGGCGTTGCC
GTAAGCCCAC CGATTCTGGC GGCGATGATG CTGGTGATGG GCTGGCGCGG GATGTTTATT
ACCATTGGTG TACTGGGGAT TTTTCTCGCC ATCGGCTGGT ATATGCTCTA TCGCAACCGC
GAGCACGTAG AACTGACTGC CGTTGAACAA GCTTATCTCA ATGCAGGTAG CGTCAATGCC
CGCCGAGATC CGCTCAGTTT TGCCGAATGG CGCAGCTTGT TCCGTAACCG CACAATGTGG
GGAATGATGC TCGGATTCAG TGGCATCAAC TACACTGCGT GGCTGTATCT GGCCTGGCTT
CCTGGTTACC TGCAAACAGC CTATAACCTG GATTTAAAAA GCACAGGGTT GATGGCGGCT
ATCCCTTTCC TGTTTGGGGC TGCCGGGATG CTGGTCAACG GTTACGTTAC TGACTGGCTG
GTCAAAGGGG GAATGGCTCC GATTAAAAGC CGTAAGATCT GCATTATTGC CGGGATGTTC
TGTTCTGCCG CCTTTACGCT GGTCGTACCG CAAGCGACAA CATCCATGAC AGCGGTTCTG
CTGATTGGTA TGGCACTGTT CTGTATTCAC TTTGCCGGAA CATCCTGCTG GGGCTTGATC
CACGTCGCAG TTGCTTCTCG CATGACTGCG TCGGTGGGCA GTATCCAGAA CTTTGCCAGC
TTCATCTGCG CCTCTTTTGC GCCGATCATT ACTGGTTTTA TTGTTGATAC CACCCACTCA
TTCCGTCTGG CACTAATCAT CTGCGGTTGC GTCACCGCAG CGGGGGCACT GGCGTACATC
TTCCTGGTTC GTCAGCCGAT CAACGACCCA CGGAAAGATT AA
 
Protein sequence
MEKENITLDP RSSFTPSSSA DIPVPPDGLV QRSTRIKRIQ TTAMLLLFFA AVINYLDRSS 
LSVANLTIRE ELGLSATEIG ALLSVFSLAY GIAQLPCGPL LDRKGPRLML GLGMFFWSLF
QAMSGMVHSF TQFVLVRIGM GIGEAPMNPC GVKVINDWFN IKERGRPMGF FNAASTIGVA
VSPPILAAMM LVMGWRGMFI TIGVLGIFLA IGWYMLYRNR EHVELTAVEQ AYLNAGSVNA
RRDPLSFAEW RSLFRNRTMW GMMLGFSGIN YTAWLYLAWL PGYLQTAYNL DLKSTGLMAA
IPFLFGAAGM LVNGYVTDWL VKGGMAPIKS RKICIIAGMF CSAAFTLVVP QATTSMTAVL
LIGMALFCIH FAGTSCWGLI HVAVASRMTA SVGSIQNFAS FICASFAPII TGFIVDTTHS
FRLALIICGC VTAAGALAYI FLVRQPINDP RKD