Gene EcDH1_3537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3537 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3803040 
End bp3804542 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content55% 
IMG OID 
ProductL-arabinose isomerase 
Protein accessionACX41151 
Protein GI260450729 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTT TTGATAATTA TGAAGTGTGG TTTGTCATTG GCAGCCAGCA TCTGTATGGC 
CCGGAAACCC TGCGTCAGGT CACCCAACAT GCCGAGCACG TCGTTAATGC GCTGAATACG
GAAGCGAAAC TGCCCTGCAA ACTGGTGTTG AAACCGCTGG GCACCACGCC GGATGAAATC
ACCGCTATTT GCCGCGACGC GAATTACGAC GATCGTTGCG CTGGTCTGGT GGTGTGGCTG
CACACCTTCT CCCCGGCCAA AATGTGGATC AACGGCCTGA CCATGCTCAA CAAACCGTTG
CTGCAATTCC ACACCCAGTT CAACGCGGCG CTGCCGTGGG ACAGTATCGA TATGGACTTT
ATGAACCTGA ACCAGACTGC ACATGGCGGT CGCGAGTTCG GCTTCATTGG CGCGCGTATG
CGTCAGCAAC ATGCCGTGGT TACCGGTCAC TGGCAGGATA AACAAGCCCA TGAGCGTATC
GGCTCCTGGA TGCGTCAGGC GGTCTCTAAA CAGGATACCC GTCATCTGAA AGTCTGCCGA
TTTGGCGATA ACATGCGTGA AGTGGCGGTC ACCGATGGCG ATAAAGTTGC CGCACAGATC
AAGTTCGGTT TCTCCGTCAA TACCTGGGCG GTTGGCGATC TGGTGCAGGT GGTGAACTCC
ATCAGCGACG GCGATGTTAA CGCGCTGGTC GATGAGTACG AAAGCTGCTA CACCATGACG
CCTGCCACAC AAATCCACGG CAAAAAACGA CAGAACGTGC TGGAAGCGGC GCGTATTGAG
CTGGGGATGA AGCGTTTCCT GGAACAAGGT GGCTTCCACG CGTTCACCAC CACCTTTGAA
GATTTGCACG GTCTGAAACA GCTTCCTGGT CTGGCCGTAC AGCGTCTGAT GCAGCAGGGT
TACGGCTTTG CGGGCGAAGG CGACTGGAAA ACTGCCGCCC TGCTTCGCAT CATGAAGGTG
ATGTCAACCG GTCTGCAGGG CGGCACCTCC TTTATGGAGG ACTACACCTA TCACTTCGAG
AAAGGTAATG ACCTGGTGCT CGGCTCCCAT ATGCTGGAAG TCTGCCCGTC GATCGCCGCA
GAAGAGAAAC CGATCCTCGA CGTTCAGCAT CTCGGTATTG GTGGTAAGGA CGATCCTGCC
CGCCTGATCT TCAATACCCA AACCGGCCCA GCGATTGTCG CCAGCTTGAT TGATCTCGGC
GATCGTTACC GTCTACTGGT TAACTGCATC GACACGGTGA AAACACCGCA CTCCCTGCCG
AAACTGCCGG TGGCGAATGC GCTGTGGAAA GCGCAACCGG ATCTGCCAAC TGCTTCCGAA
GCGTGGATCC TCGCTGGTGG CGCGCACCAT ACCGTCTTCA GCCATGCACT GAACCTCAAC
GATATGCGCC AATTCGCCGA GATGCACGAC ATTGAAATCA CGGTGATTGA TAACGACACA
CGCCTGCCAG CGTTTAAAGA CGCGCTGCGC TGGAACGAAG TGTATTACGG GTTTCGTCGC
TAA
 
Protein sequence
MTIFDNYEVW FVIGSQHLYG PETLRQVTQH AEHVVNALNT EAKLPCKLVL KPLGTTPDEI 
TAICRDANYD DRCAGLVVWL HTFSPAKMWI NGLTMLNKPL LQFHTQFNAA LPWDSIDMDF
MNLNQTAHGG REFGFIGARM RQQHAVVTGH WQDKQAHERI GSWMRQAVSK QDTRHLKVCR
FGDNMREVAV TDGDKVAAQI KFGFSVNTWA VGDLVQVVNS ISDGDVNALV DEYESCYTMT
PATQIHGKKR QNVLEAARIE LGMKRFLEQG GFHAFTTTFE DLHGLKQLPG LAVQRLMQQG
YGFAGEGDWK TAALLRIMKV MSTGLQGGTS FMEDYTYHFE KGNDLVLGSH MLEVCPSIAA
EEKPILDVQH LGIGGKDDPA RLIFNTQTGP AIVASLIDLG DRYRLLVNCI DTVKTPHSLP
KLPVANALWK AQPDLPTASE AWILAGGAHH TVFSHALNLN DMRQFAEMHD IEITVIDNDT
RLPAFKDALR WNEVYYGFRR