Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0315 |
Symbol | araJ |
ID | 6271598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 304793 |
End bp | 306061 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641724554 |
Product | MFS transport protein AraJ |
Protein accession | YP_001879104 |
Protein GI | 187732035 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCCTTGC TGGTCGTTAT CCTGCAAGCT ATCACTTTAT TGGCTACGGT GATTGGTAGC CGTTCTGGTG GTTGTGATGA TGGTATGAAA AAAGTCATTT TATCTTTGGC TCTGGGCACG TTTGGTTTGG GGATGGCCGA ATTTGGCATT ATGGGCGTGC TCACGGAGCT GGCGCATAAC GTAGGAATTT CGATTCCTGC TGCCGGGCAT ATGATCTCGT ATTATGCACT GGGGGTGGTG GTCGGTGCGC CAATCATCGC ACTCTTTTCC AGCCGCTACT CACTCAAACA TATCTTGTTG TTTCTGGTGG CGTTGTGCGT CATTGGCAAC GCCATGTTCA CGCTCTCTTC GTCTTACCTG ATGCTCGCCA TTGGTCGGCT GGTATCCGGC TTTCCGCATG GCGCATTTTT TGGCGTCGGC GCGATCGTGT TATCAAAAAT TATCAAACCC GGAAAAGTCA CCGCCGCCGT GGCGGGGATG GTTTCCGGGA TGACAGTCGC CAATTTGCTG GGCATTCCGC TGGGAACGTA TTTAAGTCAG GAATTTAGCT GGCGTTACAC CTTTTTATTG ATCGCTGTTT TTAATATTGC GGTGATGGCA TCGGTCTATT TTTGGGTGCC GGATATTCGC GACGAGGCGA AAGGAAAGCT GCGCGAACAA TTTCACTTTT TGCGCAGCCC GGCCCCGTGG TTAATTTTCG CCGCCACCAT GTTTGGCAAC GCAGGTGTGT TTGCCTGGTT CAGCTACGTA AAGCCATACA TGATGTTTAT TTCCGGTTTT TCGGAAACGG CGATGACCTT TATTATGATG TTAGTTGGGC TAGGGATGGT GCTGGGAAAT ATGCTAAGTG GCAGGATTTC AGGACGTTAT TCACCACTGC GCATTGCAGC AGTGACTGAC TTTATAATTG TACTGGCACT GCTGATGCTC TTTTTCTGCG GCGGCATGAA AACAACGTCG CTTATTTTTG CTTTTATTTG TTGCGCGGGA TTATTTGCCC TTTCAGCACC GCTACAAATA TTGTTACTAC AAAACGCCAA AGGCGGAGAG TTATTAGGTG CCGCAGGTGG GCAAATAGCG TTTAACCTCG GTAGCGCCGT CGGCGCATAT TGCGGTGGTA TGATGCTGAC GCTGGGGCTG GCATATAATT ACGTGGCGCT GCCTGCCGCC CTGCTTTCGT TTGCTGCGAT GTCGTCGTTG CTGCTGTATG GTCGCTATAA GCGCCAGCAA GCGGCGGATA CTCCGGTGCT GGCGAAACCA CTGGGGTAG
|
Protein sequence | MALLVVILQA ITLLATVIGS RSGGCDDGMK KVILSLALGT FGLGMAEFGI MGVLTELAHN VGISIPAAGH MISYYALGVV VGAPIIALFS SRYSLKHILL FLVALCVIGN AMFTLSSSYL MLAIGRLVSG FPHGAFFGVG AIVLSKIIKP GKVTAAVAGM VSGMTVANLL GIPLGTYLSQ EFSWRYTFLL IAVFNIAVMA SVYFWVPDIR DEAKGKLREQ FHFLRSPAPW LIFAATMFGN AGVFAWFSYV KPYMMFISGF SETAMTFIMM LVGLGMVLGN MLSGRISGRY SPLRIAAVTD FIIVLALLML FFCGGMKTTS LIFAFICCAG LFALSAPLQI LLLQNAKGGE LLGAAGGQIA FNLGSAVGAY CGGMMLTLGL AYNYVALPAA LLSFAAMSSL LLYGRYKRQQ AADTPVLAKP LG
|
| |