Gene EcolC_2797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2797 
Symbol 
ID6064949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3058819 
End bp3059946 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content56% 
IMG OID641602203 
Productmajor facilitator transporter 
Protein accessionYP_001725752 
Protein GI170020798 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.569839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.408376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCCT GGGCAACCCG TACGCCTGCT ATCCGCGATA TTCTCTCTGT CTCGATCGCT 
GAAATGGGCG GTGTTCTCTT TGGTCTGTCG ATCGGTTCGA TGAGCGGTAT TCTCTGCTCG
GCGTGGTTAG TGAAACGCTT TGGGACACGT AATGTCATCC TGGTCACGAT GTCCTGCGCA
TTGATCGGGA TGATGATATT AAGTCTGGCA CTCTGGCTGA CATCGCCCCT GCTCTTTGCC
GTTGGTCTCG GCGTCTTTGG GGCAAGTTTT GGTTCTGCGG AAGTGGCGAT AAACGTTGAA
GGTGCCGCCG TTGAGCGAGA AATGAATAAA ACGGTTTTGC CGATGATGCA CGGTTTTTAT
AGCCTGGGCA CGCTGGCAGG CGCTGGTGTC GGGATGGCAC TGACGGCCTT TGGCGTTCCG
GCAACGGTGC ACATTTTATT GGCGGCGCTG GTAGGTATCG CGCCTATTTA TATCGCCATT
CAGGCAATCC CTGACGGTAC GGGCAAAAAT GCTGCCGATG GCACCCAGCA TGGCGAAAAA
GGCGTACCTT TTTATCGCGA TATCCAGTTG CTGCTGATTG GTGTTGTGGT GCTGGCGATG
GCCTTTGCCG AAGGTTCTGC CAACGACTGG TTACCCTTAT TAATGGTTGA CGGTCACGGT
TTTAGCCCCA CTTCCGGCTC GCTGATTTAT GCCGGTTTTA CCCTGGGGAT GACCGTTGGA
CGCTTCACTG GCGGTTGGTT CATCGACCGT TACAGTCGCG TTGCCGTGGT TCGGGCCAGT
GCGCTAATGG GGGCGTTGGG TATTGGGCTG ATTATTTTTG TCGATAGCGC CTGGGTCGCT
GGGGTGTCTG TTGTACTCTG GGGAATGGGT GCCTCGCTGG GCTTCCCGCT GACCATTTCT
GCCGCCAGCG ATACCGGCCC CGATGCACCA ACCCGCGTCA GTGTGGTAGC AACGACCGGT
TATCTGGCTT TCCTCGTCGG GCCGCCGCTG CTGGGCTATC TCGGCGAACA TTATGGATTA
CGTAGTGCAA TGCTGGTTGT ACTGGCGCTG GTTATTCTCG CGGCTATTGT CGCGAAAGCC
GTCGCCAAAC CCGATACCAA AACGCAGACG GCGATGGAGA ATAGTTGA
 
Protein sequence
MASWATRTPA IRDILSVSIA EMGGVLFGLS IGSMSGILCS AWLVKRFGTR NVILVTMSCA 
LIGMMILSLA LWLTSPLLFA VGLGVFGASF GSAEVAINVE GAAVEREMNK TVLPMMHGFY
SLGTLAGAGV GMALTAFGVP ATVHILLAAL VGIAPIYIAI QAIPDGTGKN AADGTQHGEK
GVPFYRDIQL LLIGVVVLAM AFAEGSANDW LPLLMVDGHG FSPTSGSLIY AGFTLGMTVG
RFTGGWFIDR YSRVAVVRAS ALMGALGIGL IIFVDSAWVA GVSVVLWGMG ASLGFPLTIS
AASDTGPDAP TRVSVVATTG YLAFLVGPPL LGYLGEHYGL RSAMLVVLAL VILAAIVAKA
VAKPDTKTQT AMENS