Gene ECH74115_0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0067 
SymbolaraA 
ID6967375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp71662 
End bp73164 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content56% 
IMG OID643384147 
ProductL-arabinose isomerase 
Protein accessionYP_002268670 
Protein GI209396986 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTT TTGATAATTA TGAAGTGTGG TTTGTAATTG GCAGCCAGCA TCTGTATGGC 
CCGGAAACCC TGCGTCAGGT CACCCAACAT GCCGAGCACG TTGTGAAAGC GCTGAATACG
GAAGCGAAAC TGCCCTGCAA ACTGGTACTG AAACCGCTGG GCACCACGCC GGATGAAATC
ACCGCTATTT GCCGTGATGC TAATTACGAC GATCGTTGCG CTGGTCTGGT GGTGTGGCTA
CACACCTTCT CCCCGGCCAA AATGTGGATC AACGGCCTGA CCATGCTCAA CAAACCGTTG
CTGCAATTCC ACACCCAGTT CAACGCGGCG CTGCCGTGGG ACAGCATCGA TATGGACTTT
ATGAACCTGA ACCAGACCGC GCACGGCGGT CGCGAGTTCG GCTTCATTGG CGCGCGTATG
CGTCAGCAAC ATGCCGTGGT TACCGGTCAC TGGCAGGATA AACAAGCCCA TGAGCGTATC
GGCTCCTGGA TGCGTCAGGC GGTATCTAAA CAGGATACCC GTCATCTGAA AGTCTGCCGA
TTTGGCGATA ACATGCGTGA AGTGGCGGTC ACCGATGGCG ATAAAGTTGC CGCACAGATC
AAGTTCGGTT TCTCCGTCAA TACCTGGGCG GTTGGCGATC TGGTGCAGGT GGTGAACTCC
ATCAGCGACG GCGATGTTAA CGCGCTGGTC GATGAGTACG AAAGCTGCTA CACCATGACG
CCTGCCACAC AAATCCACGG CGAAAAACGA CAGAACGTGC TGGAAGCGGC GCGTATTGAG
CTGGGGATGA AACGTTTCCT GGAACAAGGT GGCTTCCACG CGTTCACCAC CACCTTTGAA
GATTTGCACG GTCTGAAACA GCTTCCTGGT CTGGCCGTAC AGCGTCTGAT GCAGCAGGGT
TACGGCTTTG CGGGCGAAGG CGACTGGAAA ACTGCCGCCC TGCTTCGCAT CATGAAGGTG
ATGTCAACCG GTCTGCAGGG CGGCACCTCC TTTATGGAGG ACTACACCTA TCACTTCGAG
AAAGGCAATG ACCTGGTGCT CGGCTCCCAT ATGCTGGAAG TCTGCCCGTC GATCGCCGCA
GAAGAGAAAC CGATCCTCGA CGTTCAGCAT CTCGGTATTG GTGGTAAGGA CGATCCTGCC
CGCCTGATCT TCAATACCCA AACCGGTCCA GCGATTGTCG CCAGCTTGAT TGATCTCGGC
GATCGTTACC GTCTACTGGT TAACTGCATC GACACGGTGA AAACACCGCA CTCCCTGCCG
AAACTGCCGG TGGCGAATGC GCTGTGGAAA GCGCAACCGG ATCTGCCAAC TGCTTCCGAA
GCGTGGATCC TCGCTGGTGG CGCGCACCAT ACCGTCTTCA GCCATGCGCT GAACCTCAAC
GATATGCGCC AGTTCGCCGA GATGCACGAC ATTGAAATCA CGGTGATTGA TAACGACACC
CGCCTGCCAG CGTTTAAAGA CGCGCTGCGC TGGAACGAAG TGTATTACGG GTTTCGCCGC
TAA
 
Protein sequence
MTIFDNYEVW FVIGSQHLYG PETLRQVTQH AEHVVKALNT EAKLPCKLVL KPLGTTPDEI 
TAICRDANYD DRCAGLVVWL HTFSPAKMWI NGLTMLNKPL LQFHTQFNAA LPWDSIDMDF
MNLNQTAHGG REFGFIGARM RQQHAVVTGH WQDKQAHERI GSWMRQAVSK QDTRHLKVCR
FGDNMREVAV TDGDKVAAQI KFGFSVNTWA VGDLVQVVNS ISDGDVNALV DEYESCYTMT
PATQIHGEKR QNVLEAARIE LGMKRFLEQG GFHAFTTTFE DLHGLKQLPG LAVQRLMQQG
YGFAGEGDWK TAALLRIMKV MSTGLQGGTS FMEDYTYHFE KGNDLVLGSH MLEVCPSIAA
EEKPILDVQH LGIGGKDDPA RLIFNTQTGP AIVASLIDLG DRYRLLVNCI DTVKTPHSLP
KLPVANALWK AQPDLPTASE AWILAGGAHH TVFSHALNLN DMRQFAEMHD IEITVIDNDT
RLPAFKDALR WNEVYYGFRR