Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0243 |
Symbol | |
ID | 6067757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 281685 |
End bp | 282944 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641599642 |
Product | major facilitator superfamily transporter |
Protein accession | YP_001723249 |
Protein GI | 170018295 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAAA TGAAACACTG TTGTAAAAAT GTGGTGATCC TCATGCCCGA ACCCGTAGCC GAACCCGCGC TAAACGGATT GCGCCTGAAT TTGCGCATTG TCTCCATTGT CATGTTTAAC TTCGCCAGCT ACCTCACCAT CGGGTTGCCG CTCGCTGTAT TACCGGGCTA TGTCCATGAT GTGATGGGCT TTAGCGCTTT CTGGGCAGGA TTGGTTATCA GCCTGCAATA TTTCGCCACC TTGCTGAGCC GCCCTCATGC CGGACGTTAT GCCGATTTGC TGGGACCCAA AAAGATTGTC GTCTTCGGTT TATGCGGCTG CTTTTTGAGC GGTCTGGGAT ATCTGACGGC AGGATTAACC GCCAGTCTGC CCGTCATCAG CCTGTTATTA CTTTGCCTGG GACGCGTGAT CCTTGGGATT GGGCAAAGTT TTGCCGGAAC GGGATCGACC CTGTGGGGCG TTGGCGTGGT TGGCTCGCTG CATATCGGGC GGGTGATTTC GTGGAACGGC ATTGTCACTT ACGGGGCGAT GGCGATGGGT GCGCCGTTAG GCGTCGTGTT TTATCACTGG GGCGGCTTGC AGGCGTTAGC GTTAATCATT ATGGGCGTGG CGCTGGTGGC CATTTTGTTG GCGCTCCCGC GTCCGACGGT AAAAGCCAGT AAAGGCAAAC CGCTGCCGTT TCGCGCGGTG CTTGGGCGCG TCTGGCTGTA CGGTATGGCA CTGGCACTGG CTTCCGCCGG ATTTGGCGTT ATCGCCACCT TTATCACGCT GTTTTATGAC GCTAAAGGTT GGGACGGTGC GGCTTTCGCG CTGACGCTGT TTAGCTGTGC GTTTGTCGGT ACGCGTTTGT TATTCCCTAA CGGCATTAAC CGTATCGGCG GCTTAAACGT GGCGATGATT TGCTTTAGCG TTGAGATAAT CGGCCTGCTA CTGGTTGGCG TGGCGACTAT GCCGTGGATG GCGAAAATCG GCGTCTTACT GGCGGGGGCA GGGTTTTCGC TGGTGTTCCC GGCATTGGGT GTAGTGGCGG TAAAAGCGGT TCCGCAGCAA AATCAGGGGG CGGCGCTGGC AACTTACACC GTATTTATGG ATTTATCGCT TGGCGTGACT GGACCACTGG CTGGGCTGGT GATGAGCTGG GCGGGCGTAC CGGTGATTTA TCTGGCGGCG GCGGGACTGG TCGCAATCGC GTTATTACTG ACGTGGCGAT TAAAAAAACG GCCTCCGGAA CACGTCCCTG AGGCCGCCTC ATCATCTTAA
|
Protein sequence | MVKMKHCCKN VVILMPEPVA EPALNGLRLN LRIVSIVMFN FASYLTIGLP LAVLPGYVHD VMGFSAFWAG LVISLQYFAT LLSRPHAGRY ADLLGPKKIV VFGLCGCFLS GLGYLTAGLT ASLPVISLLL LCLGRVILGI GQSFAGTGST LWGVGVVGSL HIGRVISWNG IVTYGAMAMG APLGVVFYHW GGLQALALII MGVALVAILL ALPRPTVKAS KGKPLPFRAV LGRVWLYGMA LALASAGFGV IATFITLFYD AKGWDGAAFA LTLFSCAFVG TRLLFPNGIN RIGGLNVAMI CFSVEIIGLL LVGVATMPWM AKIGVLLAGA GFSLVFPALG VVAVKAVPQQ NQGAALATYT VFMDLSLGVT GPLAGLVMSW AGVPVIYLAA AGLVAIALLL TWRLKKRPPE HVPEAASSS
|
| |