Gene EcolC_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3053 
Symbol 
ID6066134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3333686 
End bp3334936 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content58% 
IMG OID641602469 
Productenterobactin exporter EntS 
Protein accessionYP_001726004 
Protein GI170021050 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAC AATCCTGGCT GCTTAACCTC AGCCTGTTGA AAACGCACCC GGCGTTTCGC 
GCAGTATTCC TCGCTCGTTT TATCTCAATT GTGTCTCTGG GTTTGCTCGG CGTCGCGGTG
CCGGTGCAGA TCCAGATGAT GACGCATTCT ACCTGGCAGG TGGGGCTTTC GGTGACGCTG
ACCGGCGGCG CGATGTTTGT TGGCCTGATG GTTGGCGGTG TGCTGGCGGA TCGCTATGAG
CGCAAAAAAG TGATTTTGCT GGCGCGCGGC ACCTGTGGCA TTGGCTTCAT TGGACTGTGC
CTTAATGCAC TGCTGCCGGA GCCGTCATTG CTGGCAATCT ATTTACTTGG TTTATGGGAT
GGTTTTTTCG CATCGCTTGG CGTTACGGCG CTATTGGCGG CGACACCAGC ACTGGTAGGG
CGTGAAAACT TAATGCAGGC CGGGGCGATC ACCATGTTGA CCGTGCGTCT GGGGTCGGTG
ATTTCGCCCA TGATTGGCGG TTTATTGCTG GCGACCGGTG GCGTAGCCTG GAACTACGGG
CTGGCGGCGG CGGGCACGTT TATTACCTTG CTACCGTTGT TAAGCCTTCC GGCGTTGCCA
CCGCCACCGC AGCCGCGTGA GCATCCGTTG AAATCATTAC TGGCAGGATT TCGTTTTCTG
CTTGCCAGCC CGCTGGTGGG CGGGATTGCG CTGCTGGGTG GTTTATTGAC GATGGCGAGC
GCGGTGCGGG TACTGTATCC GGCGCTGGCT GACAACTGGC AGATGTCAGC GGCACAGATT
GGTTTTCTCT ACGCGGCGAT CCCGCTCGGC GCGGCTATTG GTGCGTTAAC CAGCGGGAAG
CTGGCACATA GTGCGCGACC AGGGTTATTG ATGCTGCTCT CCACGCTGGG ATCGTTCCTC
GCCATTGGTC TGTTTGGCCT GATGCCGATG TGGATTTTAG GCGTGGTTTG TCTGGCGCTG
TTCGGCTGGT TGAGTGCGGT CAGCTCGTTG CTGCAATACA CAATGCTGCA AACGCAAACC
CCGGAAGCGA TGTTAGGGCG GATTAACGGT TTGTGGACGG CGCAGAACGT GACGGGCGAT
GCCATAGGCG CGGCGCTGCT GGGTGGTTTG GGCGCGATGA TGACACCGGT TGCTTCCGCA
AGCGCGAGCG GTTTTGGTTT GTTGATTATC GGCGTGTTGT TATTGCTGGT GCTGGTGGAG
TTGCGACATT TTCGCCAGAC GCCGCCGCAG GTGACAGCGT CCGACAGTTA A
 
Protein sequence
MNKQSWLLNL SLLKTHPAFR AVFLARFISI VSLGLLGVAV PVQIQMMTHS TWQVGLSVTL 
TGGAMFVGLM VGGVLADRYE RKKVILLARG TCGIGFIGLC LNALLPEPSL LAIYLLGLWD
GFFASLGVTA LLAATPALVG RENLMQAGAI TMLTVRLGSV ISPMIGGLLL ATGGVAWNYG
LAAAGTFITL LPLLSLPALP PPPQPREHPL KSLLAGFRFL LASPLVGGIA LLGGLLTMAS
AVRVLYPALA DNWQMSAAQI GFLYAAIPLG AAIGALTSGK LAHSARPGLL MLLSTLGSFL
AIGLFGLMPM WILGVVCLAL FGWLSAVSSL LQYTMLQTQT PEAMLGRING LWTAQNVTGD
AIGAALLGGL GAMMTPVASA SASGFGLLII GVLLLLVLVE LRHFRQTPPQ VTASDS