Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0425 |
Symbol | araJ |
ID | 6146639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 433575 |
End bp | 434843 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615321 |
Product | MFS transport protein AraJ |
Protein accession | YP_001742528 |
Protein GI | 170683106 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.87857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCTCGC TGGTCGTTAT CCTGCAAGCT ATCACTTTAT TGGCTACGGT GATTAGTAGC CGTTCTGGTG GTTGTGATGG TGGTATGAAA AAAGTCATTT TATCTTTGGC TCTGGGCACG TTTGGTTTGG GGATGGCCGA ATTTGGCATT ATGGGCGTGC TCACGGAGCT GGCGCATAAC GTAGGAATTT CGATTCCTGC TGCCGGGCAT ATGATCTCGT ATTATGCGCT GGGGGTGGTG GTCGGTGCGC CAATCATCGC ACTCTTTTCC AGCCGCTACT CACTCAAACA TATCTTATTG TTTCTGGTGG CGTTGTGCGT CATTGGCAAC GCCATGTTCA CGCTCTCTTC GTCTTACCTG ATGCTCGCCA TTGGTCGGCT GGTATCCGGC TTTCCGCATG GCGCATTTTT TGGCGTCGGC GCGATCGTGT TATCAAAAAT TATCAAACCC GGAAAAGTCA CCGCCGCCGT GGCGGGGATG GTTTCCGGGA TGACAGTCGC CAATTTGCTG GGCATTCCGC TGGGAACGTA TTTAAGTCAG GAATTTAGCT GGCGTTACAC CTTTTTATTG ATCGCTGTTT TTAATATTGT GGTGATGGCA TCGGTCTATT TTTGGGTGCC GGATATTCGC GACGAAGCGA AAGGAAAGCT GCGCGAACAA TTTCACTTTT TACGCAGCCC GGCCCCGTGG TTAATTTTCG CCGCCACCAT GTTTGGCAAC GCAGGTGTAT TTGCCTGGTT CAGCTACGTA AAGCCATACA TGATGTTTAT TTCCGGTTTT TCGGAAACGG CGATGACCTT TATTATGATG TTAGTGGGGC TAGGGATGGT GCTGGGGAAT GTGCTAAGTG GCCGAATTTC AGGACGTTAT TCACCACTGC GCATTGCAGC AGTGACTGAC TTTATCATTG TACTGGCACT GCTGATGCTC TTTTTCTGCG GCGGCATGAA AATAACGTCG CTTATTTTTG CTTTTATTTG TTGCGCGGGA TTATTTGCCC TTTCAGCACC TCTGCAAATA TTGTTACTGC AAAACGCCAA AGGCGGAGAG TTATTAGGTG CCGCAGGTGG GCAAATAGCG TTTAACCTCG GTAGCGCCGT CGGCGCATAT TGCGGTGGTA TGATGCTGAC GCTGGGGCTG GCATATAATT ACGTGGCGCT GCCTGCCGCC CTGCTTTCGT TTGCTGCGAT GTCGTCGTTG CTGCTGTATG GTCGCTATAA GCGCCAGCAA GCGGCGGATA GTCCGGTGCT GGCGAAACCA CTGGGGTAG
|
Protein sequence | MASLVVILQA ITLLATVISS RSGGCDGGMK KVILSLALGT FGLGMAEFGI MGVLTELAHN VGISIPAAGH MISYYALGVV VGAPIIALFS SRYSLKHILL FLVALCVIGN AMFTLSSSYL MLAIGRLVSG FPHGAFFGVG AIVLSKIIKP GKVTAAVAGM VSGMTVANLL GIPLGTYLSQ EFSWRYTFLL IAVFNIVVMA SVYFWVPDIR DEAKGKLREQ FHFLRSPAPW LIFAATMFGN AGVFAWFSYV KPYMMFISGF SETAMTFIMM LVGLGMVLGN VLSGRISGRY SPLRIAAVTD FIIVLALLML FFCGGMKITS LIFAFICCAG LFALSAPLQI LLLQNAKGGE LLGAAGGQIA FNLGSAVGAY CGGMMLTLGL AYNYVALPAA LLSFAAMSSL LLYGRYKRQQ AADSPVLAKP LG
|
| |