Gene EcE24377A_0064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0064 
SymbolaraA 
ID5586876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp68525 
End bp70027 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content55% 
IMG OID640923795 
ProductL-arabinose isomerase 
Protein accessionYP_001461232 
Protein GI157155370 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0849875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTT TTGATAATTA TGAAGTGTGG TTTGTCATTG GCAGCCAGCA TCTGTATGGC 
CCGGAAACCC TGCGTCAGGT CACCCAACAT GCCGAGCACG TTGTTAATGC GCTGAATACG
GAAGCGAAAC TGCCCTGCAA ACTAGTGTTG AAACCGCTGG GCACCACGCC GGATGAAATC
ACCGCTATTT GCCGCGACGC GAATTACGAC GATCGTTGCG CTGGTCTGGT GGTGTGGCTG
CACACTTTCT CCCCGGCCAA AATGTGGATC AACGGCCTGA CCATGCTCAA CAAACCGTTG
CTGCAATTCC ACACCCAGTT TAACGCGGCG CTGCCGTGGG ACAGTATCGA TATGGACTTT
ATGAACCTGA ACCAGACTGC ACATGGCGGT CGCGAGTTCG GCTTCATTGG CGCGCGTATG
CGTCAGCAAC ATGCCGTGGT TACCGGTCAC TGGCAGGATA AACAAGCCCA TGAGCGTATC
GGCTCCTGGA TGCGCCAGGC GGTCTCTAAA CAGGATACCC GTCATCTGAA AGTCTGCCGT
TTTGGCGATA ACATGCGTGA AGTAGCGGTC ACCGATGGTG ATAAAGTTGC CGCACAGATC
AAGTTCGGTT TCTCCGTCAA TACCTGGGCG GTTGGCGATC TGGTGCAGGT GGTGAACTCC
ATCAGCGACG GCGATGTTAA CGCGCTGGTC GATGAGTACG AAAGCTGCTA CACCATGACG
CCTGCCACAC AAATCCACGG CGAAAAACGA CAGAACGTGC TGGAAGCGGC GCGTATTGAG
CTGGGGATGA AGCGTTTCCT GGAACAAGGT GGCTTCCACG CGTTCACCAC CACCTTTGAA
GATTTGCACG GTCTGAAACA GCTTCCAGGT CTGGCCGTAC AGCGTCTGAT GCAGCAGGGT
TACGGCTTTG CGGGCGAAGG CGACTGGAAA ACCGCCGCCC TGCTTCGCAT CATGAAGGTG
ATGTCAACCG GTCTGCAGGG CGGCACCTCC TTTATGGAGG ACTACACCTA TCACTTCGAG
AAAGGTAATG ACCTGGTGCT CGGCTCCCAT ATGCTGGAAG TCTGCCCGTC GATTGCCGTA
GAAGAGAAAC CGATCCTCGA CGTTCAGCAT CTCGGTATTG GTGGTAAGGA CGATCCTGCC
CGACTGATCT TCAATACCCA AACCGGTCCA GCGATTGTCG CCAGCCTGAT TGATCTCGGC
GATCGTTACC GTCTGCTGGT TAACTGTATC GACACGGTGA AAACACCGCA CTCCCTGCCG
AAACTGCCGG TGGCGAATGC GCTGTGGAAA GCGCAACCGG ATCTGCCAAC TGCTTCCGAA
GCGTGGATCC TTGCTGGTGG CGCGCACCAT ACCGTCTTCA GCCATGCGCT GAACCTCAAC
GATATGCGCC AGTTCGCCGA GATGCACGAC ATTGAAATCA CGGTGATTGA TAACGACACC
CGCCTGCCAG CGTTTAAAGA CGCACTGCGC TGGAACGAAG TGTATTACGG ATTTCGTCGC
TAA
 
Protein sequence
MTIFDNYEVW FVIGSQHLYG PETLRQVTQH AEHVVNALNT EAKLPCKLVL KPLGTTPDEI 
TAICRDANYD DRCAGLVVWL HTFSPAKMWI NGLTMLNKPL LQFHTQFNAA LPWDSIDMDF
MNLNQTAHGG REFGFIGARM RQQHAVVTGH WQDKQAHERI GSWMRQAVSK QDTRHLKVCR
FGDNMREVAV TDGDKVAAQI KFGFSVNTWA VGDLVQVVNS ISDGDVNALV DEYESCYTMT
PATQIHGEKR QNVLEAARIE LGMKRFLEQG GFHAFTTTFE DLHGLKQLPG LAVQRLMQQG
YGFAGEGDWK TAALLRIMKV MSTGLQGGTS FMEDYTYHFE KGNDLVLGSH MLEVCPSIAV
EEKPILDVQH LGIGGKDDPA RLIFNTQTGP AIVASLIDLG DRYRLLVNCI DTVKTPHSLP
KLPVANALWK AQPDLPTASE AWILAGGAHH TVFSHALNLN DMRQFAEMHD IEITVIDNDT
RLPAFKDALR WNEVYYGFRR