Gene EcSMS35_0064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0064 
SymbolaraA 
ID6147409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp71898 
End bp73400 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content55% 
IMG OID641614965 
ProductL-arabinose isomerase 
Protein accessionYP_001742181 
Protein GI170682791 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.203815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.780498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTT TTGATAATTA TGAAGTGTGG TTTGTAATTG GCAGCCAGCA TCTTTACGGC 
CCGGAGACTC TGCGCCAGGT GACGCAACAT GCGGAACACG TTGTTAATGC ACTGAATACA
GAAGCGAAGT TGCCCTGCAA ACTGGTGCTG AAACCGCTGG GCACCACGCC GGATGAAATC
ACCGCTATTT GCCGCGACGC GAATTACGAC GATCGTTGCG CTGGTCTGGT GGTGTGGCTG
CACACCTTCT CCCCGGCCAA AATGTGGATC AACGGCCTGA CCATGCTCAA CAAACCGTTG
CTGCAATTCC ACACCCAGTT CAACGCGGCG CTGCCGTGGG ACAGCATCGA CATGGACTTT
ATGAACCTGA ACCAGACCGC GCATGGCGGT CGCGAGTTCG GCTTCATCGG CGCGCGTATG
CGTCAGCAAC ATGCCGTCGT TACCGGTCAC TGGCAGGATA AACAAGCACA TGAGCGTATC
GGCTCCTGGA TGCGTCAGGC GGTATCTAAA CAGGATACCC GTCATCTGAA AGTCTGCCGT
TTTGGCGATA ACATGCGTGA AGTGGCGGTC ACCGATGGCG ATAAAGTTGC CGCACAGATC
AAGTTCGGTT TCTCCGTCAA TACCTGGGCG GTTGGCGATC TGGTGCAGGT AGTGAACTCC
ATCAGCGATG GCGATGTTAA CGCGCTGGTC GATGAGTACG AAAGCTGCTA CACCATGACG
CCTGCGACAC AAATCCACGG CGAAAAACGA CAGAACGTGC TGGAAGCGGC ACGTATTGAG
CTGGGGATGA AGCGTTTCCT GGAACAAGGT GGCTTCCACG CGTTCACCAC CACCTTTGAA
GATTTGCACG GTCTGAAGCA GCTTCCTGGT CTGGCCGTAC AGCGTCTGAT GCAGCAGGGC
TACGGCTTTG CGGGCGAAGG CGACTGGAAA ACTGCCGCCC TGCTTCGCAT CATGAAGGTG
ATGTCAACCG GTCTGCAGGG CGGCACCTCC TTTATGGAGG ACTACACTTA CCACTTCGAA
AAAGGTAATG ACCTGGTGCT CGGCTCCCAT ATGCTGGAAG TCTGTCCGTC GATCGCCGTA
GAAGAGAAAC CGATCCTCGA CGTTCAACAC CTCGGTATTG GCGGTAAAGA CGATCCTGCC
CGCCTGATCT TCAACACTCA AACCGGTCCG GCCATTGTCG CCAGTCTGAT TGATCTCGGC
GATCGTTACC GTCTGCTGGT TAACTGCATC GACACTGTGA AAACACCGCA CTCCCTGCCG
AAACTGCCGG TGGCGAATGC GCTGTGGAAA GCGCAACCGG ATCTGCCAAC TGCTTCCGAA
GCGTGGATCC TCGCTGGTGG CGCGCACCAT ACCGTTTTCA GCCATGCGCT GAACCTCAAC
GATATGCGCC AGTTCGCCGA GATGCACGAC ATTGAAATCA CAGTGATTGA TAACGATACC
CGCCTGCCAG CGTTTAAAGA CGCGCTGCGC TGGAACGAAG TGTATTACGG ATTTCGTCGC
TAA
 
Protein sequence
MTIFDNYEVW FVIGSQHLYG PETLRQVTQH AEHVVNALNT EAKLPCKLVL KPLGTTPDEI 
TAICRDANYD DRCAGLVVWL HTFSPAKMWI NGLTMLNKPL LQFHTQFNAA LPWDSIDMDF
MNLNQTAHGG REFGFIGARM RQQHAVVTGH WQDKQAHERI GSWMRQAVSK QDTRHLKVCR
FGDNMREVAV TDGDKVAAQI KFGFSVNTWA VGDLVQVVNS ISDGDVNALV DEYESCYTMT
PATQIHGEKR QNVLEAARIE LGMKRFLEQG GFHAFTTTFE DLHGLKQLPG LAVQRLMQQG
YGFAGEGDWK TAALLRIMKV MSTGLQGGTS FMEDYTYHFE KGNDLVLGSH MLEVCPSIAV
EEKPILDVQH LGIGGKDDPA RLIFNTQTGP AIVASLIDLG DRYRLLVNCI DTVKTPHSLP
KLPVANALWK AQPDLPTASE AWILAGGAHH TVFSHALNLN DMRQFAEMHD IEITVIDNDT
RLPAFKDALR WNEVYYGFRR