Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0067 |
Symbol | araA |
ID | 6967375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 71662 |
End bp | 73164 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384147 |
Product | L-arabinose isomerase |
Protein accession | YP_002268670 |
Protein GI | 209396986 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATTT TTGATAATTA TGAAGTGTGG TTTGTAATTG GCAGCCAGCA TCTGTATGGC CCGGAAACCC TGCGTCAGGT CACCCAACAT GCCGAGCACG TTGTGAAAGC GCTGAATACG GAAGCGAAAC TGCCCTGCAA ACTGGTACTG AAACCGCTGG GCACCACGCC GGATGAAATC ACCGCTATTT GCCGTGATGC TAATTACGAC GATCGTTGCG CTGGTCTGGT GGTGTGGCTA CACACCTTCT CCCCGGCCAA AATGTGGATC AACGGCCTGA CCATGCTCAA CAAACCGTTG CTGCAATTCC ACACCCAGTT CAACGCGGCG CTGCCGTGGG ACAGCATCGA TATGGACTTT ATGAACCTGA ACCAGACCGC GCACGGCGGT CGCGAGTTCG GCTTCATTGG CGCGCGTATG CGTCAGCAAC ATGCCGTGGT TACCGGTCAC TGGCAGGATA AACAAGCCCA TGAGCGTATC GGCTCCTGGA TGCGTCAGGC GGTATCTAAA CAGGATACCC GTCATCTGAA AGTCTGCCGA TTTGGCGATA ACATGCGTGA AGTGGCGGTC ACCGATGGCG ATAAAGTTGC CGCACAGATC AAGTTCGGTT TCTCCGTCAA TACCTGGGCG GTTGGCGATC TGGTGCAGGT GGTGAACTCC ATCAGCGACG GCGATGTTAA CGCGCTGGTC GATGAGTACG AAAGCTGCTA CACCATGACG CCTGCCACAC AAATCCACGG CGAAAAACGA CAGAACGTGC TGGAAGCGGC GCGTATTGAG CTGGGGATGA AACGTTTCCT GGAACAAGGT GGCTTCCACG CGTTCACCAC CACCTTTGAA GATTTGCACG GTCTGAAACA GCTTCCTGGT CTGGCCGTAC AGCGTCTGAT GCAGCAGGGT TACGGCTTTG CGGGCGAAGG CGACTGGAAA ACTGCCGCCC TGCTTCGCAT CATGAAGGTG ATGTCAACCG GTCTGCAGGG CGGCACCTCC TTTATGGAGG ACTACACCTA TCACTTCGAG AAAGGCAATG ACCTGGTGCT CGGCTCCCAT ATGCTGGAAG TCTGCCCGTC GATCGCCGCA GAAGAGAAAC CGATCCTCGA CGTTCAGCAT CTCGGTATTG GTGGTAAGGA CGATCCTGCC CGCCTGATCT TCAATACCCA AACCGGTCCA GCGATTGTCG CCAGCTTGAT TGATCTCGGC GATCGTTACC GTCTACTGGT TAACTGCATC GACACGGTGA AAACACCGCA CTCCCTGCCG AAACTGCCGG TGGCGAATGC GCTGTGGAAA GCGCAACCGG ATCTGCCAAC TGCTTCCGAA GCGTGGATCC TCGCTGGTGG CGCGCACCAT ACCGTCTTCA GCCATGCGCT GAACCTCAAC GATATGCGCC AGTTCGCCGA GATGCACGAC ATTGAAATCA CGGTGATTGA TAACGACACC CGCCTGCCAG CGTTTAAAGA CGCGCTGCGC TGGAACGAAG TGTATTACGG GTTTCGCCGC TAA
|
Protein sequence | MTIFDNYEVW FVIGSQHLYG PETLRQVTQH AEHVVKALNT EAKLPCKLVL KPLGTTPDEI TAICRDANYD DRCAGLVVWL HTFSPAKMWI NGLTMLNKPL LQFHTQFNAA LPWDSIDMDF MNLNQTAHGG REFGFIGARM RQQHAVVTGH WQDKQAHERI GSWMRQAVSK QDTRHLKVCR FGDNMREVAV TDGDKVAAQI KFGFSVNTWA VGDLVQVVNS ISDGDVNALV DEYESCYTMT PATQIHGEKR QNVLEAARIE LGMKRFLEQG GFHAFTTTFE DLHGLKQLPG LAVQRLMQQG YGFAGEGDWK TAALLRIMKV MSTGLQGGTS FMEDYTYHFE KGNDLVLGSH MLEVCPSIAA EEKPILDVQH LGIGGKDDPA RLIFNTQTGP AIVASLIDLG DRYRLLVNCI DTVKTPHSLP KLPVANALWK AQPDLPTASE AWILAGGAHH TVFSHALNLN DMRQFAEMHD IEITVIDNDT RLPAFKDALR WNEVYYGFRR
|
| |