Gene EcolC_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1941 
Symbol 
ID6068542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2144406 
End bp2145620 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content45% 
IMG OID641601352 
Productmajor facilitator transporter 
Protein accessionYP_001724914 
Protein GI170019960 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.923029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000874845 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAATC CCTATTACCC TACCGCACTT GGGTTGTATT TTAATTACCT GGTGCATGGT 
ATGGGCGTCA TTTTGATGAG CCTGAATATG GCCTCGCTGG AGACACTTTG GCAGACTAAT
GCCGCGGGTG TCTCGATAGT TATCTCATCG CTGGGCATTG GTCGATTAAG TGTCTTGCTT
TTTGCAGGAT TATTATCCGA TCGCTTTGGT CGCCGCCCTT TTATCATGCT CGGGATGTGC
TGCTATATGG CCTTCTTTTT TGGCATCCTG CAGACCAATA ACATCATTAT CGCTTATGTT
TTTGGCTTTC TGGCGGGAAT GGCAAACAGT TTTCTCGATG CAGGCACTTA TCCCAGTTTG
ATGGAAGCTT TTCCACGCTC ACCTGGGACA GCCAATATTT TAATTAAAGC ATTTGTTTCC
AGCGGACAAT TTTTATTACC GCTAATCATT AGCCTGTTAG TGTGGGCTGA ACTGTGGTTC
GGTTGGTCCT TTATGATTGC TGCAGGCATT ATGTTTATTA ACGCTCTGTT TTTATACCGT
TGTACGTTCC CACCCCATCC GGGTCGTCGC TTACCTGTCA TAAAGAAAAC CACCAGCTCT
ACGGAACATC GCTGTTCAAT TATCGATTTA GCCAGTTATA CCTTATATGG CTATATCTCA
ATGGCAACGT TTTATCTGGT TAGCCAGTGG CTGGCACAGT ACGGACAATT TGTTGCAGGC
ATGTCATACA CTATGTCGAT CAAACTACTC AGTATCTACA CCGTGGGTTC GCTGCTTTGT
GTATTTATTA CCGCTCCACT CATTCGTAAT ACCGTTCGCC CAACAACATT ACTGATGCTG
TACACCTTTA TCTCATTTAT TGCTCTGTTT ACCGTCTGCC TGCATCCCAC ATTTTATGTG
GTGATAATAT TTGCTTTTGT CATTGGTTTT ACCTCTGCTG GAGGTGTTGT GCAAATTGGC
CTGACGTTAA TGGCTGAACG TTTCCCTTAC GCTAAAGGTA AAGCTACAGG GATCTATTAC
AGTGCGGGCA GTATTGCGAC CTTTACTATT CCGTTGATTA CGGCTCATCT GTCCCAAAGA
AGTATTGCCG ATATTATGTG GTTCGATACC GCCATCGCTG CCATCGGTTT TTTACTGGCA
CTGTTTATCG GCTTACGCAG CCGCAAAAAA ACGCGGCATC ACTCGCTAAA GGAAAATGTC
GCTCCGGGTG GGTAA
 
Protein sequence
MKNPYYPTAL GLYFNYLVHG MGVILMSLNM ASLETLWQTN AAGVSIVISS LGIGRLSVLL 
FAGLLSDRFG RRPFIMLGMC CYMAFFFGIL QTNNIIIAYV FGFLAGMANS FLDAGTYPSL
MEAFPRSPGT ANILIKAFVS SGQFLLPLII SLLVWAELWF GWSFMIAAGI MFINALFLYR
CTFPPHPGRR LPVIKKTTSS TEHRCSIIDL ASYTLYGYIS MATFYLVSQW LAQYGQFVAG
MSYTMSIKLL SIYTVGSLLC VFITAPLIRN TVRPTTLLML YTFISFIALF TVCLHPTFYV
VIIFAFVIGF TSAGGVVQIG LTLMAERFPY AKGKATGIYY SAGSIATFTI PLITAHLSQR
SIADIMWFDT AIAAIGFLLA LFIGLRSRKK TRHHSLKENV APGG