Gene EcDH1_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4081 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4418488 
End bp4419747 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID 
ProductL-rhamnose isomerase 
Protein accessionACX41681 
Protein GI260451259 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTC AACTGGAACA GGCCTGGGAG CTAGCGAAAC AGCGTTTCGC GGCGGTGGGG 
ATTGATGTCG AGGAGGCGCT GCGCCAACTT GATCGTTTAC CCGTTTCAAT GCACTGCTGG
CAGGGCGATG ATGTTTCCGG TTTTGAAAAC CCGGAAGGTT CGCTGACCGG GGGGATTCAG
GCCACAGGCA ATTATCCGGG CAAAGCGCGT AATGCCAGTG AGCTACGTGC CGATCTGGAA
CAGGCTATGC GGCTGATTCC GGGGCCGAAA CGGCTTAATT TACATGCCAT CTATCTGGAA
TCAGATACGC CAGTCTCGCG CGACCAGATC AAACCAGAGC ACTTCAAAAA CTGGGTTGAA
TGGGCGAAAG CCAATCAGCT CGGTCTGGAT TTTAACCCCT CCTGCTTTTC GCATCCGCTA
AGCGCCGATG GCTTTACGCT TTCCCATGCC GACGACAGCA TTCGCCAGTT CTGGATTGAT
CACTGCAAAG CCAGCCGTCG CGTTTCGGCC TATTTTGGCG AGCAACTCGG CACACCATCG
GTGATGAACA TCTGGATCCC GGATGGTATG AAAGATATCA CCGTTGACCG TCTCGCCCCG
CGTCAGCGTC TGCTGGCAGC ACTGGATGAG GTGATCAGCG AGAAGCTAAA CCCTGCGCAC
CATATCGACG CCGTTGAGAG CAAATTGTTT GGCATTGGCG CAGAGAGCTA CACGGTTGGC
TCCAATGAGT TTTACATGGG GTATGCCACC AGCCGCCAGA CTGCGCTGTG CCTGGACGCC
GGGCACTTCC ACCCGACTGA AGTGATTTCC GACAAGATTT CCGCCGCCAT GCTGTATGTG
CCGCAGTTGC TGCTGCACGT CAGCCGTCCG GTTCGCTGGG ACAGCGATCA CGTAGTGCTG
CTGGATGATG AAACCCAGGC AATTGCCAGT GAGATTGTGC GTCACGATCT GTTTGACCGG
GTGCATATCG GCCTTGACTT CTTCGATGCC TCTATCAACC GCATTGCCGC GTGGGTCATT
GGTACACGCA ATATGAAAAA AGCCCTGCTG CGTGCGTTGC TGGAACCTAC CGCTGAGCTG
CGCAAGCTGG AAGCGGCGGG CGATTACACT GCGCGTCTGG CACTGCTGGA AGAGCAGAAA
TCGTTGCCGT GGCAGGCGGT CTGGGAAATG TATTGCCAAC GTCACGATAC GCCAGCAGGT
AGCGAATGGC TGGAGAGCGT GCGGGCTTAT GAGAAAGAAA TTTTGAGTCG CCGCGGGTAA
 
Protein sequence
MTTQLEQAWE LAKQRFAAVG IDVEEALRQL DRLPVSMHCW QGDDVSGFEN PEGSLTGGIQ 
ATGNYPGKAR NASELRADLE QAMRLIPGPK RLNLHAIYLE SDTPVSRDQI KPEHFKNWVE
WAKANQLGLD FNPSCFSHPL SADGFTLSHA DDSIRQFWID HCKASRRVSA YFGEQLGTPS
VMNIWIPDGM KDITVDRLAP RQRLLAALDE VISEKLNPAH HIDAVESKLF GIGAESYTVG
SNEFYMGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PQLLLHVSRP VRWDSDHVVL
LDDETQAIAS EIVRHDLFDR VHIGLDFFDA SINRIAAWVI GTRNMKKALL RALLEPTAEL
RKLEAAGDYT ARLALLEEQK SLPWQAVWEM YCQRHDTPAG SEWLESVRAY EKEILSRRG