Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2637 |
Symbol | araF |
ID | 6969060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2492585 |
End bp | 2493574 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643386500 |
Product | L-arabinose ABC transporter, periplasmic L-arabinose-binding protein |
Protein accession | YP_002270982 |
Protein GI | 209399828 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000000000000871652 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCACAAAT TTACTAAAGC CCTGGCAGCC ATTGGTCTGG CAGCCGTTAT GTCACAATCC GCTATGGCGG AGAACCTGAA GCTCGGTTTC CTGGTGAAGC AACCGGAAGA GCCGTGGTTC CAGACCGAAT GGAAGTTTGC CGATAAAGCC GGGAAGGATT TAGGGTTTGA GGTTATTAAG ATTGCCGTGC CGGATGGCGA AAAAACATTG AACGCGATCG ATAGCCTGGC CGCCAGTGGC GCAAAAGGTT TCGTTATTTG TACTCCGGAC CCGAAACTCG GCTCTGCCAT CGTGGCGAAA GCGCGTGGCT ACGATATGAA AGTCATTGCC GTGGATGACC AGTTTGTTAA CGCCAAAGGT AAGCCAATGG ATACCGTTCC TCTGGTGATG ATGGCGGCGA CCAAAATTGG CGAACGTCAG GGCCAGGAAC TGTATAAAGA GATGCAGAAA CGTGGCTGGG ATGTCAAAGA AAGCGCGGTG ATGGCGATTA CCTCCAACGA ACTGGATACG GCTCGTCGCC GTACTACGGG TTCTATGGAT GCGCTGAAAG CTGCAGGATT CCCGGAAAAA CAAATTTATC AGGTACCTAC CAAATCTAAC GACATCCCCG GGGCATTTGA CGCTGCCAAC TCAATGCTGG TTCAACATCC GGAAGTTAAA CATTGGCTGA TCGTCGGTAT GAACGACAGC ACCGTGCTGG GCGGCGTACG CGCGACGGAA GGTCAGGGCT TTAAAGCGGC CGATATCATC GGTATCGGTA TTAACGGTGT GGATGCGGTG AGCGAACTGT CTAAGGCACA GGCAACCGGC TTCTACGGTT CCCTGTTGCC AAGCCCGGAC GTACATGGTT ATAAGTCCAG CGAAATGCTT TACAACTGGG TAGCAAAAGG TGTTGAACCG ACCAAATTCA CCGAAGTTAC CGACGTGGTG CTGATCACGC GTGACAACTT TAAAGAAGAA CTGGAGAAAA AAGGTTTAGG CGGTAAGTAA
|
Protein sequence | MHKFTKALAA IGLAAVMSQS AMAENLKLGF LVKQPEEPWF QTEWKFADKA GKDLGFEVIK IAVPDGEKTL NAIDSLAASG AKGFVICTPD PKLGSAIVAK ARGYDMKVIA VDDQFVNAKG KPMDTVPLVM MAATKIGERQ GQELYKEMQK RGWDVKESAV MAITSNELDT ARRRTTGSMD ALKAAGFPEK QIYQVPTKSN DIPGAFDAAN SMLVQHPEVK HWLIVGMNDS TVLGGVRATE GQGFKAADII GIGINGVDAV SELSKAQATG FYGSLLPSPD VHGYKSSEML YNWVAKGVEP TKFTEVTDVV LITRDNFKEE LEKKGLGGK
|
| |