Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3053 |
Symbol | |
ID | 6066134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3333686 |
End bp | 3334936 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641602469 |
Product | enterobactin exporter EntS |
Protein accession | YP_001726004 |
Protein GI | 170021050 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAC AATCCTGGCT GCTTAACCTC AGCCTGTTGA AAACGCACCC GGCGTTTCGC GCAGTATTCC TCGCTCGTTT TATCTCAATT GTGTCTCTGG GTTTGCTCGG CGTCGCGGTG CCGGTGCAGA TCCAGATGAT GACGCATTCT ACCTGGCAGG TGGGGCTTTC GGTGACGCTG ACCGGCGGCG CGATGTTTGT TGGCCTGATG GTTGGCGGTG TGCTGGCGGA TCGCTATGAG CGCAAAAAAG TGATTTTGCT GGCGCGCGGC ACCTGTGGCA TTGGCTTCAT TGGACTGTGC CTTAATGCAC TGCTGCCGGA GCCGTCATTG CTGGCAATCT ATTTACTTGG TTTATGGGAT GGTTTTTTCG CATCGCTTGG CGTTACGGCG CTATTGGCGG CGACACCAGC ACTGGTAGGG CGTGAAAACT TAATGCAGGC CGGGGCGATC ACCATGTTGA CCGTGCGTCT GGGGTCGGTG ATTTCGCCCA TGATTGGCGG TTTATTGCTG GCGACCGGTG GCGTAGCCTG GAACTACGGG CTGGCGGCGG CGGGCACGTT TATTACCTTG CTACCGTTGT TAAGCCTTCC GGCGTTGCCA CCGCCACCGC AGCCGCGTGA GCATCCGTTG AAATCATTAC TGGCAGGATT TCGTTTTCTG CTTGCCAGCC CGCTGGTGGG CGGGATTGCG CTGCTGGGTG GTTTATTGAC GATGGCGAGC GCGGTGCGGG TACTGTATCC GGCGCTGGCT GACAACTGGC AGATGTCAGC GGCACAGATT GGTTTTCTCT ACGCGGCGAT CCCGCTCGGC GCGGCTATTG GTGCGTTAAC CAGCGGGAAG CTGGCACATA GTGCGCGACC AGGGTTATTG ATGCTGCTCT CCACGCTGGG ATCGTTCCTC GCCATTGGTC TGTTTGGCCT GATGCCGATG TGGATTTTAG GCGTGGTTTG TCTGGCGCTG TTCGGCTGGT TGAGTGCGGT CAGCTCGTTG CTGCAATACA CAATGCTGCA AACGCAAACC CCGGAAGCGA TGTTAGGGCG GATTAACGGT TTGTGGACGG CGCAGAACGT GACGGGCGAT GCCATAGGCG CGGCGCTGCT GGGTGGTTTG GGCGCGATGA TGACACCGGT TGCTTCCGCA AGCGCGAGCG GTTTTGGTTT GTTGATTATC GGCGTGTTGT TATTGCTGGT GCTGGTGGAG TTGCGACATT TTCGCCAGAC GCCGCCGCAG GTGACAGCGT CCGACAGTTA A
|
Protein sequence | MNKQSWLLNL SLLKTHPAFR AVFLARFISI VSLGLLGVAV PVQIQMMTHS TWQVGLSVTL TGGAMFVGLM VGGVLADRYE RKKVILLARG TCGIGFIGLC LNALLPEPSL LAIYLLGLWD GFFASLGVTA LLAATPALVG RENLMQAGAI TMLTVRLGSV ISPMIGGLLL ATGGVAWNYG LAAAGTFITL LPLLSLPALP PPPQPREHPL KSLLAGFRFL LASPLVGGIA LLGGLLTMAS AVRVLYPALA DNWQMSAAQI GFLYAAIPLG AAIGALTSGK LAHSARPGLL MLLSTLGSFL AIGLFGLMPM WILGVVCLAL FGWLSAVSSL LQYTMLQTQT PEAMLGRING LWTAQNVTGD AIGAALLGGL GAMMTPVASA SASGFGLLII GVLLLLVLVE LRHFRQTPPQ VTASDS
|
| |