Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2989 |
Symbol | araE |
ID | 6145568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3069397 |
End bp | 3070815 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617858 |
Product | arabinose-proton symporter |
Protein accession | YP_001745010 |
Protein GI | 170681217 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00879] MFS transporter, sugar porter (SP) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0609314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTACTA TCAATACGGA ATCTGCTTTA ACGCCACGTT CTTTGCGGGA TACGCGGCGT ATGAATATGT TTGTTTCGGT AGCTGCTGCG GTCGCAGGAT TGTTATTTGG TCTTGATATC GGCGTAATCG CCGGAGCCTT GCCGTTCATT ACCGATCACT TTGTGCTGAC CAGCCGTTTG CAGGAATGGG TGGTCAGTAG CATGATGCTC GGCGCAGCAA TTGGTGCGCT GTTTAATGGT TGGCTGTCGT TCCGTCTGGG GCGTAAATAC AGCCTAATGG CGGGGGCCAT CCTGTTTGTG CTCGGCTCGA TAGGGTCCGC TTTTGCGACC AGCGTAGAGA TGTTAATCGC CGCTCGTGTG GTGCTGGGCA TTGCTGTCGG GATCGCGTCT TACACCGCTC CTCTGTATCT TTCTGAAATG GCAAGTGAAA ACGTTCGCGG TAAGATGATC AGTATGTACC AGTTGATGGT CACACTCGGC ATCGTGCTGG CGTTTTTATC CGATACAGCG TTCAGTTATA GCGGTAACTG GCGCGCAATG TTGGGGGTTC TTGCTTTACC AGCAGTCCTG CTGATTATTC TGGTGGTCTT CCTGCCAAAT AGCCCGCGCT GGCTCGCGGA AAAAGGGCGT CATATTGAGG CGGAAGAAGT GTTGCGTATG CTGCGCGATA CGTCGGAAAA AGCGCGAGAA GAACTCAACG AAATTCGTGA AAGCCTGAAG TTAAAACAGG GCGGTTGGGC ACTGTTTAAG ATCAACCGTA ACGTCCGTCG TGCTGTGTTT CTCGGTATGT TGTTGCAGGC GATGCAGCAG TTTACCGGTA TGAACATCAT CATGTACTAC GCGCCGCGTA TCTTCAAAAT GGCGGGCTTT ACGACCACAG AACAACAGAT GATTGCGACT CTGGTCGTGG GGCTTACCTT TATGTTCGCC ACCTTCATTG CGGTCTTTAC GGTAGATAAA GCAGGGCGTA AACCGGCTCT GAAAATTGGT TTCAGCGTGA TGGCGTTAGG CACTCTGGTG CTGGGCTATT GCCTGATGCA GTTTGATAAC GGTACGGCTT CCAGTGGCTT GTCCTGGCTC TCTGTTGGCA TGACGATGAT GTGTATTGCC GGTTATGCGA TGAGCGCCGC GCCAGTGGTG TGGATCCTGT GCTCTGAGAT TCAGCCGCTG AAATGCCGCG ATTTTGGCAT CACCTGTTCG ACGACGACAA ACTGGGTGTC GAATATGATT ATCGGCGCGA CCTTCCTGAC ACTGCTTGAT AGCATTGGCG CTGCCGGTAC GTTCTGGCTC TACACTGGGC TGAACATTGC GTTTGTGGGC ATTACTTTCT GGCTCATTCC GGAAACCAAA AATGTCACGC TGGAACATAT CGAACGCAAA CTGATGGCAG GCGAGAAGTT GAGAAATATC GGCGTCTGA
|
Protein sequence | MVTINTESAL TPRSLRDTRR MNMFVSVAAA VAGLLFGLDI GVIAGALPFI TDHFVLTSRL QEWVVSSMML GAAIGALFNG WLSFRLGRKY SLMAGAILFV LGSIGSAFAT SVEMLIAARV VLGIAVGIAS YTAPLYLSEM ASENVRGKMI SMYQLMVTLG IVLAFLSDTA FSYSGNWRAM LGVLALPAVL LIILVVFLPN SPRWLAEKGR HIEAEEVLRM LRDTSEKARE ELNEIRESLK LKQGGWALFK INRNVRRAVF LGMLLQAMQQ FTGMNIIMYY APRIFKMAGF TTTEQQMIAT LVVGLTFMFA TFIAVFTVDK AGRKPALKIG FSVMALGTLV LGYCLMQFDN GTASSGLSWL SVGMTMMCIA GYAMSAAPVV WILCSEIQPL KCRDFGITCS TTTNWVSNMI IGATFLTLLD SIGAAGTFWL YTGLNIAFVG ITFWLIPETK NVTLEHIERK LMAGEKLRNI GV
|
| |