Gene EcolC_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1863 
Symbol 
ID6066677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2062711 
End bp2064069 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content45% 
IMG OID641601276 
Productmajor facilitator transporter 
Protein accessionYP_001724838 
Protein GI170019884 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.369973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000261137 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAAT ATGATCAAAT TGGCGCAAGA CTGGACCGCT TGCCTTTGGC CCGGTTTCAT 
TATCGTATAT TTGGTATAAT AAGCTTTAGT CTGTTATTAA CGGGGTTTTT GAGTTACTCA
GGTAATGTCG TCTTAGCAAA GCTGGTAAGC AATGGATGGT CAAATAATTT CCTCAATGCC
GCCTTTACCT CGGCATTAAT GTTTGGTTAT TTCATCGGCT CACTTACTGG TGGGTTTATT
GGTGACTACT TTGGGCGGCG CAGGGCGTTT CGCATAAATC TTCTCATCGT CGGAATTGCT
GCAACAGGGG CCGCTTTTGT CCCTGATATG TACTGGCTCA TTTTCTTTCG CTTCCTCATG
GGAACAGGAA TGGGGGCGCT GATTATGGTT GGCTATGCCT CATTTACGGA GTTTATCCCC
GCGACGGTGC GTGGAAAATG GTCCGCGCGG CTCTCATTTG TTGGTAACTG GTCGCCCATG
CTATCTGCGG CGATAGGCGT GGTGGTTATC GCTTTTTTTA GTTGGCGGAT AATGTTTCTG
TTGGGGGGTA TTGGCATACT GTTAGCCTGG CTTCTCTCAG GTAAATACTT TATTGAGTCG
CCACGATGGC TGGCAGGGAA AGGGCAAATC GCCGGTGCAG AAAGCCAACT TCGTGAAGTA
GAGCAGCAAA TTGAAAGAGA GAAGAGTATT CGTTTACCCC AGCTTACTTT GAACCAGAGC
AACAGCAAGG TTAAGGTAAT CAAGGGTACC TTCTGGCTCC TGTTTAAAGG GGAAATGTTA
CGACGTACAT TAGTCGCGAT TACTGTTTTA ATTGCAATGA ACATTTCGCT TTATACCATC
ACCGTATGGA TACCGACCAT ATTTGTTAAC TCCGGCATTG ATGTCGATAA ATCAATATTA
ATGACCGCTG TTATTATGAT TGGCGCTCCG GTAGGAATAT TTATTGCGGC ATTAATTATT
GATCATTTTC CTCGTCGATT ATTTGGCTCC GCCTTACTTA TTATTATTGC CGTGTTAGGC
TATATCTATT CAATTCAGAC TACAGAGTGG GCGATTTTAA TCTATGGTCT GGTGATGATC
TTCTTTTTAT ACATGTATGT TTGCTTCGCG TCGGCGGTTT ATATCCCGGA GCTTTGGCCA
ACGCATTTAC GCCTGCGCGG TTCGGGTTTC GTTAATGCCG TCGGACGGAT CGTCGCAGTT
TTCACGCCCT ATGGCGTTGC GGCATTATTA ACACATTATG GGTCGATCAC GGTGTTTATG
GTGCTTGGTG TCATGTTATT GCTCTGTGCG CTGGTTCTCT CCATTTTTGG CATCGAAACG
CGGAAGGTGT CGTTGGAAGA GATTTCTGAG GTGAATTAA
 
Protein sequence
MEQYDQIGAR LDRLPLARFH YRIFGIISFS LLLTGFLSYS GNVVLAKLVS NGWSNNFLNA 
AFTSALMFGY FIGSLTGGFI GDYFGRRRAF RINLLIVGIA ATGAAFVPDM YWLIFFRFLM
GTGMGALIMV GYASFTEFIP ATVRGKWSAR LSFVGNWSPM LSAAIGVVVI AFFSWRIMFL
LGGIGILLAW LLSGKYFIES PRWLAGKGQI AGAESQLREV EQQIEREKSI RLPQLTLNQS
NSKVKVIKGT FWLLFKGEML RRTLVAITVL IAMNISLYTI TVWIPTIFVN SGIDVDKSIL
MTAVIMIGAP VGIFIAALII DHFPRRLFGS ALLIIIAVLG YIYSIQTTEW AILIYGLVMI
FFLYMYVCFA SAVYIPELWP THLRLRGSGF VNAVGRIVAV FTPYGVAALL THYGSITVFM
VLGVMLLLCA LVLSIFGIET RKVSLEEISE VN