Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3162 |
Symbol | araE |
ID | 5588916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3177457 |
End bp | 3178875 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640926804 |
Product | arabinose-proton symporter |
Protein accession | YP_001464177 |
Protein GI | 157159072 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.839771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACTA TCAATACGGA ATCTGCTTTA ACGCCACGTT CTTTGCGTGA TACGCGGCGT ATGAATATGT TTGTTTCGGT AGCTGCTGCG GTCGCAGGAT TGTTATTTGG TCTTGATATC GGCGTAATCG CCGGAGCGTT GCCGTTCATT ACCGATCACT TTGTGCTGAC CAGTCGTTTG CAGGAATGGG TGGTTAGTAG CATGATGCTC GGCGCAGCAA TCGGTGCGCT GTTTAATGGT TGGCTGTCGT TCCGCCTGGG GCGTAAATAC AGCCTGATGG CGGGGGCCAT CCTGTTTGTA CTCGGTTCTA TAGGGTCCGC TTTTGCGACC AGCGTAGAGA TGTTAATCGC CGCTCGTGTG GTGCTGGGCA TTGCTGTCGG GATCGCGTCT TACACCGCTC CTCTGTATCT TTCTGAAATG GCAAGTGAAA ACGTTCGCGG TAAGATGATC AGTATGTACC AGTTGATGGT CACACTCGGC ATCGTGCTGG CGTTTTTATC CGATACAGCG TTCAGTTATA GCGGTAACTG GCGCGCAATG TTGGGGGTTC TTGCTTTACC AGCAGTCCTG CTGATTATTC TGGTGGTCTT CCTGCCAAAT AGCCCGCGCT GGCTGGCGGA AAAGGGGCGT CATATTGAGG CGGAAGAAGT GTTGCGTATG CTGCGCGATA CGTCGGAAAA AGCGCGAGAA GAACTCAACG AAATTCGTGA AAGCCTGAAG TTAAAACAAG GCGGTTGGGC ACTGTTTAAG ATCAACCGTA ACGTCCGTCG TGCTGTGTTT CTCGGTATGT TGTTGCAGGC GATGCAGCAG TTTACCGGTA TGAACATCAT CATGTACTAT GCGCCGCGTA TCTTCAAAAT GGCGGGCTTT ACGACCACAG AACAACAGAT GATTGCGACT CTGGTCGTGG GACTGACCTT TATGTTCGCG ACCTTCATTG CGGTCTTTAC GGTAGATAAA GCAGGTCGTA AACCGGCTCT GAAAATTGGT TTCAGCGTGA TGGCGTTAGG CACTCTGGTG CTGGGCTATT GCCTGATGCA GTTTGATAAC GGTACGGCTT CCAGTGGCTT GTCCTGGCTC TCTGTTGGCA TGACGATGAT GTGTATTGCC GGTTATGCGA TGAGCGCCGC GCCAGTGGTG TGGATCCTGT GCTCTGAAAT TCAGCCGCTG AAATGCCGCG ATTTCGGTAT TACCTGTTCG ACGACGACAA ACTGGGTGTC GAATATGATT ATCGGCGCGA CCTTCCTGAC ACTGCTTGAT AGCATTGGCG CTGCCGGTAC GTTCTGGCTC TACACTGCGC TGAACATTGC GTTTGTGGGC ATCACTTTCT GGCTCATTCC GGAAACCAAA AATGTCACGC TGGAACATAT CGAACGCAAA CTGATGGCAG GCGAGAAGTT GAGAAATATC GGCGTCTGA
|
Protein sequence | MVTINTESAL TPRSLRDTRR MNMFVSVAAA VAGLLFGLDI GVIAGALPFI TDHFVLTSRL QEWVVSSMML GAAIGALFNG WLSFRLGRKY SLMAGAILFV LGSIGSAFAT SVEMLIAARV VLGIAVGIAS YTAPLYLSEM ASENVRGKMI SMYQLMVTLG IVLAFLSDTA FSYSGNWRAM LGVLALPAVL LIILVVFLPN SPRWLAEKGR HIEAEEVLRM LRDTSEKARE ELNEIRESLK LKQGGWALFK INRNVRRAVF LGMLLQAMQQ FTGMNIIMYY APRIFKMAGF TTTEQQMIAT LVVGLTFMFA TFIAVFTVDK AGRKPALKIG FSVMALGTLV LGYCLMQFDN GTASSGLSWL SVGMTMMCIA GYAMSAAPVV WILCSEIQPL KCRDFGITCS TTTNWVSNMI IGATFLTLLD SIGAAGTFWL YTALNIAFVG ITFWLIPETK NVTLEHIERK LMAGEKLRNI GV
|
| |