Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3595 |
Symbol | |
ID | 6067404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3931244 |
End bp | 3932746 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603013 |
Product | L-arabinose isomerase |
Protein accession | YP_001726536 |
Protein GI | 170021582 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000176548 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGACGATTT TTGATAATTA TGAAGTGTGG TTTGTCATTG GCAGCCAGCA TCTGTATGGC CCGGAAACCC TGCGTCAGGT CACCCAACAT GCCGAGCACG TTGTTAATGC GCTGAATACG GAAGCGAAAC TGCCCTGCAA ACTGGTGTTG AAACCGCTGG GCACCACGCC GGATGAAATC ACCGCTATTT GCCGCGACGC GAATTACGAC GATCGTTGCG CTGGTCTGGT GGTGTGGCTG CACACCTTCT CCCCGGCCAA AATGTGGATC AACGGCCTGA CCATGCTCAA CAAACCGTTG CTGCAATTCC ACACCCAGTT CAACGCGGCG CTGCCGTGGG ACAGTATCGA TATGGACTTT ATGAACCTGA ACCAGACTGC ACATGGCGGT CGTGAGTTCG GCTTCATTGG CGCGCGTATG CGTCAGCAAC ATGCCGTCGT TACCGGTCAC TGGCAGGATA AACAAGCCCA TGAGCGTATC GGCTCCTGGA TGCGTCAGGC GGTTTCTAAA CAGGATACCC GTCATCTGAA AGTCTGCCGT TTTGGCGATA ACATGCGTGA AGTGGCGGTC ACCGATGGTG ATAAAGTTGC CGCACAGATC AAGTTCGGTT TCTCCGTCAA TACCTGGGCG GTTGGCGATC TGGTGCAGGT GGTGAACTCC ATCAGCGACG GCGATGTTAA CGCGCTGGTC GATGAGTACG AAAGCTGCTA CACCATGACG CCTGCAACAC AAATCCACGG CGAAAAACGA CAGAACGTGC TGGAAGCGGC GCGTATTGAG CTGGGGATGA AGCGTTTCCT GGAACAAGGT GGCTTCCACG CGTTCACCAC CACCTTTGAA GATTTGCACG GTCTGAAACA GCTTCCAGGT CTGGCCGTAC AGCGTCTGAT GCAGCAGGGT TACGGCTTTG CGGGCGAAGG CGACTGGAAA ACCGCCGCCC TGCTTCGCAT CATGAAGGTG ATGTCAACCG GTCTGCAGGG CGGCACCTCC TTTATGGAGG ACTACACCTA TCACTTCGAG AAAGGTAATG ACCTGGTGCT CGGCTCCCAT ATGCTGGAAG TCTGCCCGTC GATTGCCGTA GAAGAGAAAC CGATCCTCGA CGTTCAGCAT CTCGGTATTG GTGGTAAGGA CGATCCTGCC CGACTGATCT TCAATACCCA AACCGGTCCA GCGATTGTCG CCAGCCTGAT TGATCTCGGC GATCGTTACC GTCTGCTGGT TAACTGTATC GACACGGTGA AAACACCGCA CTCCCTGCCG AAACTGCCGG TGGCGAATGC GCTGTGGAAA GCGCAACCGG ATCTGCCAAC TGCTTCCGAA GCGTGGATCC TCGCTGGTGG CGCGCACCAT ACCGTCTTCA GCCATGCGCT GAACCTCAAC GATATGCGCC AGTTCGCCGA GATGCACGAC ATTGAAATCA CGGTGATTGA TAACGACACC CGCCTGCCAG CGTTTAAAGA CGCACTGCGC TGGAACGAAG TGTATTACGG ATTTCGTCGC TAA
|
Protein sequence | MTIFDNYEVW FVIGSQHLYG PETLRQVTQH AEHVVNALNT EAKLPCKLVL KPLGTTPDEI TAICRDANYD DRCAGLVVWL HTFSPAKMWI NGLTMLNKPL LQFHTQFNAA LPWDSIDMDF MNLNQTAHGG REFGFIGARM RQQHAVVTGH WQDKQAHERI GSWMRQAVSK QDTRHLKVCR FGDNMREVAV TDGDKVAAQI KFGFSVNTWA VGDLVQVVNS ISDGDVNALV DEYESCYTMT PATQIHGEKR QNVLEAARIE LGMKRFLEQG GFHAFTTTFE DLHGLKQLPG LAVQRLMQQG YGFAGEGDWK TAALLRIMKV MSTGLQGGTS FMEDYTYHFE KGNDLVLGSH MLEVCPSIAV EEKPILDVQH LGIGGKDDPA RLIFNTQTGP AIVASLIDLG DRYRLLVNCI DTVKTPHSLP KLPVANALWK AQPDLPTASE AWILAGGAHH TVFSHALNLN DMRQFAEMHD IEITVIDNDT RLPAFKDALR WNEVYYGFRR
|
| |