Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0064 |
Symbol | araA |
ID | 6147409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 71898 |
End bp | 73400 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641614965 |
Product | L-arabinose isomerase |
Protein accession | YP_001742181 |
Protein GI | 170682791 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.203815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.780498 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATTT TTGATAATTA TGAAGTGTGG TTTGTAATTG GCAGCCAGCA TCTTTACGGC CCGGAGACTC TGCGCCAGGT GACGCAACAT GCGGAACACG TTGTTAATGC ACTGAATACA GAAGCGAAGT TGCCCTGCAA ACTGGTGCTG AAACCGCTGG GCACCACGCC GGATGAAATC ACCGCTATTT GCCGCGACGC GAATTACGAC GATCGTTGCG CTGGTCTGGT GGTGTGGCTG CACACCTTCT CCCCGGCCAA AATGTGGATC AACGGCCTGA CCATGCTCAA CAAACCGTTG CTGCAATTCC ACACCCAGTT CAACGCGGCG CTGCCGTGGG ACAGCATCGA CATGGACTTT ATGAACCTGA ACCAGACCGC GCATGGCGGT CGCGAGTTCG GCTTCATCGG CGCGCGTATG CGTCAGCAAC ATGCCGTCGT TACCGGTCAC TGGCAGGATA AACAAGCACA TGAGCGTATC GGCTCCTGGA TGCGTCAGGC GGTATCTAAA CAGGATACCC GTCATCTGAA AGTCTGCCGT TTTGGCGATA ACATGCGTGA AGTGGCGGTC ACCGATGGCG ATAAAGTTGC CGCACAGATC AAGTTCGGTT TCTCCGTCAA TACCTGGGCG GTTGGCGATC TGGTGCAGGT AGTGAACTCC ATCAGCGATG GCGATGTTAA CGCGCTGGTC GATGAGTACG AAAGCTGCTA CACCATGACG CCTGCGACAC AAATCCACGG CGAAAAACGA CAGAACGTGC TGGAAGCGGC ACGTATTGAG CTGGGGATGA AGCGTTTCCT GGAACAAGGT GGCTTCCACG CGTTCACCAC CACCTTTGAA GATTTGCACG GTCTGAAGCA GCTTCCTGGT CTGGCCGTAC AGCGTCTGAT GCAGCAGGGC TACGGCTTTG CGGGCGAAGG CGACTGGAAA ACTGCCGCCC TGCTTCGCAT CATGAAGGTG ATGTCAACCG GTCTGCAGGG CGGCACCTCC TTTATGGAGG ACTACACTTA CCACTTCGAA AAAGGTAATG ACCTGGTGCT CGGCTCCCAT ATGCTGGAAG TCTGTCCGTC GATCGCCGTA GAAGAGAAAC CGATCCTCGA CGTTCAACAC CTCGGTATTG GCGGTAAAGA CGATCCTGCC CGCCTGATCT TCAACACTCA AACCGGTCCG GCCATTGTCG CCAGTCTGAT TGATCTCGGC GATCGTTACC GTCTGCTGGT TAACTGCATC GACACTGTGA AAACACCGCA CTCCCTGCCG AAACTGCCGG TGGCGAATGC GCTGTGGAAA GCGCAACCGG ATCTGCCAAC TGCTTCCGAA GCGTGGATCC TCGCTGGTGG CGCGCACCAT ACCGTTTTCA GCCATGCGCT GAACCTCAAC GATATGCGCC AGTTCGCCGA GATGCACGAC ATTGAAATCA CAGTGATTGA TAACGATACC CGCCTGCCAG CGTTTAAAGA CGCGCTGCGC TGGAACGAAG TGTATTACGG ATTTCGTCGC TAA
|
Protein sequence | MTIFDNYEVW FVIGSQHLYG PETLRQVTQH AEHVVNALNT EAKLPCKLVL KPLGTTPDEI TAICRDANYD DRCAGLVVWL HTFSPAKMWI NGLTMLNKPL LQFHTQFNAA LPWDSIDMDF MNLNQTAHGG REFGFIGARM RQQHAVVTGH WQDKQAHERI GSWMRQAVSK QDTRHLKVCR FGDNMREVAV TDGDKVAAQI KFGFSVNTWA VGDLVQVVNS ISDGDVNALV DEYESCYTMT PATQIHGEKR QNVLEAARIE LGMKRFLEQG GFHAFTTTFE DLHGLKQLPG LAVQRLMQQG YGFAGEGDWK TAALLRIMKV MSTGLQGGTS FMEDYTYHFE KGNDLVLGSH MLEVCPSIAV EEKPILDVQH LGIGGKDDPA RLIFNTQTGP AIVASLIDLG DRYRLLVNCI DTVKTPHSLP KLPVANALWK AQPDLPTASE AWILAGGAHH TVFSHALNLN DMRQFAEMHD IEITVIDNDT RLPAFKDALR WNEVYYGFRR
|
| |