Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_02689 |
Symbol | araE |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | - |
Start bp | 2819046 |
End bp | 2820464 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | arabinose transporter |
Protein accession | ACT44505 |
Protein GI | 253978835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACTA TCAATACGGA ATCTGCTTTA ACGCCACGTT CTTTGCGGGA TACGCGGCGT ATGAATATGT TTGTTTCGGT AGCTGCTGCG GTCGCAGGAT TGTTATTTGG TCTTGATATC GGCGTAATCG CCGGAGCGTT GCCGTTCATT ACCGATCACT TTGTGCTGAC CAGTCGTTTG CAGGAATGGG TGGTTAGTAG CATGATGCTC GGTGCAGCAA TTGGTGCGCT GTTTAATGGT TGGCTGTCGT TCCGCCTGGG GCGTAAATAC AGCCTGATGG CGGGGGCCAT CCTGTTTGTA CTCGGTTCTA TAGGGTCCGC TTTTGCGACC AGCGTAGAGA TGTTAATCGC CGCTCGTGTG GTGCTGGGCA TTGCTGTCGG GATCGCGTCT TACACCGCTC CTCTGTATCT TTCTGAAATG GCAAGTGAAA ACGTTCGCGG TAAGATGATC AGTATGTACC AGTTGATGGT CACACTCGGC ATCGTGCTGG CGTTTTTATC CGATACAGCG TTCAGTTATA GCGGTAACTG GCGCGCAATG TTGGGGGTTC TTGCTTTACC AGCAGTCCTG CTGATTATTC TGGTGGTCTT CCTGCCAAAT AGCCCGCGCT GGCTAGCGGA AAAGGGGCGT CATATTGAGG CGGAAGAAGT GTTGCGTATG CTGCGCGATA CGTCGGAAAA AGCGCGAGAA GAACTCAACG AAATTCGTGA AAGCCTGAAG TTAAAACAGG GCGGTTGGGC ACTGTTTAAG ATCAACCGTA ACGTCCGTCG TGCTGTGTTT CTCGGTATGT TGTTGCAGGC GATGCAGCAG TTTACCGGTA TGAACATCAT CATGTACTAC GCGCCGCGTA TCTTCAAAAT GGCGGGCTTT ACGACCACAG AACAACAGAT GATTGCGACT CTGGTCGTGG GACTGACCTT TATGTTCGCG ACCTTCATTG CGGTCTTTAC GGTAGATAAA GCAGGTCGTA AACCGGCTCT GAAAATTGGT TTCAGCGTGA TGGCGTTAGG CACTCTGGTG CTGGGCTATT GCCTGATGCA GTTTGATAAC GGTACGGCTT CCAGTGGCTT GTCCTGGCTC TCTGTTGGCA TGACGATGAT GTGTATTGCC GGTTATGCGA TGAGCGCCGC GCCAGTGGTG TGGATCCTGT GCTCTGAAAT TCAGCCGCTG AAATGCCGCG ATTTCGGTAT TACCTGTTCG ACCACCACGA ACTGGGTGTC GAATATGATT ATCGGCGCGA CCTTCCTGAC GCTGCTCGAC AGCATTGGCG CTGCCGGTAC GTTCTGGCTC TACACTGCGC TGAACATTGC GTTTGTGTGC ATCACTTTCT GGCTCATTCC GGAAACCAAA AATGTCACCC TGGAACATAT CGAACGCAAA CTGATGGCAG GCGAGAAGTT GAGAAATATC GGCGTCTGA
|
Protein sequence | MVTINTESAL TPRSLRDTRR MNMFVSVAAA VAGLLFGLDI GVIAGALPFI TDHFVLTSRL QEWVVSSMML GAAIGALFNG WLSFRLGRKY SLMAGAILFV LGSIGSAFAT SVEMLIAARV VLGIAVGIAS YTAPLYLSEM ASENVRGKMI SMYQLMVTLG IVLAFLSDTA FSYSGNWRAM LGVLALPAVL LIILVVFLPN SPRWLAEKGR HIEAEEVLRM LRDTSEKARE ELNEIRESLK LKQGGWALFK INRNVRRAVF LGMLLQAMQQ FTGMNIIMYY APRIFKMAGF TTTEQQMIAT LVVGLTFMFA TFIAVFTVDK AGRKPALKIG FSVMALGTLV LGYCLMQFDN GTASSGLSWL SVGMTMMCIA GYAMSAAPVV WILCSEIQPL KCRDFGITCS TTTNWVSNMI IGATFLTLLD SIGAAGTFWL YTALNIAFVC ITFWLIPETK NVTLEHIERK LMAGEKLRNI GV
|
| |