Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1023 |
Symbol | araH |
ID | 6268609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 944907 |
End bp | 945893 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725166 |
Product | L-arabinose transporter permease protein |
Protein accession | YP_001879688 |
Protein GI | 187732756 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCTG TTTCTACATC GGGTTCTGGC GCACCTAAGT CGTCATTCAG CTTCGGGCGT ATCTGGGATC AGTACGGCAT GCTGGTGGTG TTTGCGGTGC TCTTTATCGC CTGTGCCATT TTTGTCCCAA ATTTTGCCAC CTTCATTAAT ATGAAAGGGT TGGGCCTGGC AATTTCCATG TCGGGGATGG TGGCGTGTGG CATGTTGTTC TGCCTTGCTT CCGGTGACTT TGACCTTTCT GTCGCCTCCG TAATTGCCTG TGCGGGTGTC ACCACGGCGG TGGTTATCAA CCTGACTGAA AGCCTGTGGA TTGGCGTGGC AGCGGGGTTG CTGCTTGGCA TTCTCTGTGG CCTGGTCAAT GGCTTTGTTA TCGCCAAACT GAAAATAAAT GCTCTGATCA CAACACTGGC AACGATGCAG ATTGTTCGAG GTCTGGCGTA CATCATTTCA GACGGTAAAG CGGTCGGTAT CGAAGATGAA AGCTTCTTTG CCCTTGGTTA CGCTAACTGG TTCGGTCTGC CTGCGCCAAT CTGGCTCACC GTCGCGTGTC TGATTATCTT TGGTTTGTTG CTGAATAAAA CCACCTTTGG TCGTAACACC CTGGCGATTG GCGGGAACGA AGAGGCTGCG CGTCTGGCGG GTGTACCGGT TGTTCGCACC AAAATTATTA TCTTTGTTCT CTCTGGCCTG GTATCTGCGA TAGCCGGAAT TATTCTGGCT TCACGTATGA CCAGTGGGCA GCCAATGACG TCGATTGGTT ACGAGCTGAT TGTTATCTCC GCCTGCGTTT TAGGTGGCGT TTCTCTGAAA GGAGGCATCG GAAAAATCTC ATATGTGGTG GCGGGTATCT TAATTTTAGG CACCGTGGAA AACGCCATGA ACCTGCTTAA TATTTCTCCT TTCGCGCAGT ACGTGGTTCG CGGCTTAATC CTGCTGGCAG CGGTGATCTT CGACCGTTAC AAGCAAAAAG CGAAACGCAC TGTCTGA
|
Protein sequence | MSSVSTSGSG APKSSFSFGR IWDQYGMLVV FAVLFIACAI FVPNFATFIN MKGLGLAISM SGMVACGMLF CLASGDFDLS VASVIACAGV TTAVVINLTE SLWIGVAAGL LLGILCGLVN GFVIAKLKIN ALITTLATMQ IVRGLAYIIS DGKAVGIEDE SFFALGYANW FGLPAPIWLT VACLIIFGLL LNKTTFGRNT LAIGGNEEAA RLAGVPVVRT KIIIFVLSGL VSAIAGIILA SRMTSGQPMT SIGYELIVIS ACVLGGVSLK GGIGKISYVV AGILILGTVE NAMNLLNISP FAQYVVRGLI LLAAVIFDRY KQKAKRTV
|
| |