Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2635 |
Symbol | araH |
ID | 6968475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2490000 |
End bp | 2490986 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643386498 |
Product | L-arabinose transporter permease protein |
Protein accession | YP_002270980 |
Protein GI | 209396335 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.348948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00000000000485021 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTTCTG TTTCTACATC GGGTTCTGGC GCACCTAAGT CGTCATTCAG CTTCGGGCGT ATCTGGGATC AGTACGGCAT GCTGGTGGTG TTTGCGGTGC TTTTTATCGC CTGTGCCATT TTTGTCCCGA ACTTTGCCAC CTTTATTAAT ATGAAAGGGT TGGGCCTGGC AATTTCCATG TCGGGGATGG TGGCGTGTGG CATGTTGTTC TGCCTTGCTT CCGGTGACTT TGACCTTTCT GTCGCCTCCG TAATTGCCTG TGCGGGTGTC ACCACGGCGG TGGTTATCAA TCTGACTGAA AGCCTGTGGA TTGGCGTGGC AGCGGGGTTG CTGCTTGGCA TTCTCTGTGG CCTGGTCAAT GGCTTTGTTA TCGCCAAACT GAAAATAAAT GCTCTGATCA CGACACTGGC AACGATGCAG ATTGTTCGAG GTCTGGCGTA CATCATTTCC GACGGTAAAG CGGTCGGTAT CGAAGATGAA AGCTTCTTTG CCCTTGGTTA CGCCAACTGG TTCGGTCTGC CTGCGCCAAT CTGGCTCACC GTCGCGTGTC TGATTATCTT TGGTTTGCTG CTGAATAAAA CTACCTTTGG TCGTAATACC CTGGCGATTG GCGGGAACGA AGAGGCTGCG CGTCTGGCGG GTGTACCGGT TGCTCGCACC AAAATTATTA TCTTTGTTCT CTCTGGCCTG GTATCTGCGA TAGCCGGAAT TATTCTGGCT TCACGTATGA CCAGTGGGCA GCCAATGACG TCGATTGGTT ATGAGCTTAT TGTTATCTCC GCCTGCGTTT TAGGTGGCGT TTCTCTGAAA GGTGGCATCG GAAAAATCTC ATATGTGGTG GCGGGTATCT TAATTTTAGG CACCGTGGAA AACGCCATGA ACCTGCTTAA TATTTCTCCT TTCGCGCAGT ACGTGGTTCG CGGCTTAATC CTGCTGGCAG CGGTGATCTT CGACCGTTAC AAGCAAAAAG CGAAACGCAC TGTCTGA
|
Protein sequence | MSSVSTSGSG APKSSFSFGR IWDQYGMLVV FAVLFIACAI FVPNFATFIN MKGLGLAISM SGMVACGMLF CLASGDFDLS VASVIACAGV TTAVVINLTE SLWIGVAAGL LLGILCGLVN GFVIAKLKIN ALITTLATMQ IVRGLAYIIS DGKAVGIEDE SFFALGYANW FGLPAPIWLT VACLIIFGLL LNKTTFGRNT LAIGGNEEAA RLAGVPVART KIIIFVLSGL VSAIAGIILA SRMTSGQPMT SIGYELIVIS ACVLGGVSLK GGIGKISYVV AGILILGTVE NAMNLLNISP FAQYVVRGLI LLAAVIFDRY KQKAKRTV
|
| |