Gene EcolC_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4114 
Symbol 
ID6065949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4539318 
End bp4540577 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID641603536 
ProductL-rhamnose isomerase 
Protein accessionYP_001727039 
Protein GI170022085 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4806] L-rhamnose isomerase 
TIGRFAM ID[TIGR01748] L-rhamnose isomerase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.937168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC AACTGGAACA GGCCTGGGAG CTAGCGAAAC AGCGTTTCGC GGCGGTGGGG 
ATTGATGTCG AGGAGGCGCT GCGCCAACTT GATCGTTTAC CCGTTTCAAT GCACTGCTGG
CAGGGCGATG ATGTTTCCGG TTTTGAAAAC TCGGAAGGTT CGCTGACCGG GGGGATTCAG
GCCACAGGCA ATTATCCGGG CAAAGCGCGT AATGCCAGTG AGCTACGTGC CGATCTGGAA
CAGGCTATGC GGCTGATTCC GGGGCCGAAA CGGCTTAATT TACATGCCAT CTATCTGGAA
TCAGATACGC CAGTCTCGCG CGACCAGATC AAACCAGAGC ACTTCAAAAA CTGGGTTGAA
TGGGCGAAAG CCAATCAGCT CGGTCTGGAT TTTAACCCCT CCTGCTTTTC GCATCCGCTA
AGCGCCGATG GCTTTACGCT TTCCCATGCC GACGACAGCA TTCGCCAGTT CTGGATTGAT
CACTGCAAAG CCAGCCGTCG CGTTTCGGCC TATTTTGGCG AGCAACTCGG CACACCATCG
GTGATGAACA TCTGGATCCC GGATGGTATG AAAGATATCA CCGTTGACCG TCTCGCCCCG
CGTCAGCGTC TGCTGGCAGC ACTGGATGAG GTGATCAGCG AGAAGCTAAA CCCTGCGCAC
CATATCGACG CCGTTGAGAG CAAATTGTTT GGCATTGGCG CAGAGAGCTA CACGGTTGGC
TCCAATGAGT TTTACATGGG GTATGCCACC AGCCGCCAGA CTGCGCTGTG CCTGGACGCC
GGGCACTTCC ACCCGACTGA AGTGATTTCC GACAAGATTT CCGCCGCCAT GCTGTATGTG
CCGCAGTTGC TGCTGCACGT CAGCCGTCCG GTTCGCTGGG ACAGCGATCA CGTAGTGCTG
CTGGATGATG AAACCCAGGC AATTGCCAGT GAGATTGTGC GTCACGATCT GTTTGACCGG
GTGCATATCG GCCTTGACTT CTTCGATGCC TCTATCAACC GCATTGCCGC GTGGGTCATT
GGTACACGCA ATATGAAAAA AGCCCTGCTG CGTGCGTTGC TGGAACCTAC CGCTGAGCTG
CGCAAGCTGG AAGCGGCGGG CGATTACACT GCGCGTCTGG CACTGCTGGA AGAGCAGAAA
TCGTTGCCGT GGCAGGCGGT CTGGGAAATG TATTGCCAAC GTCACGATAC GCCAGCAGGT
AGCGAATGGC TGGAGAGCGT GCGGGCTTAT GAGAAAGAAA TTTTGAGTCG CCGCGGGTAA
 
Protein sequence
MTTQLEQAWE LAKQRFAAVG IDVEEALRQL DRLPVSMHCW QGDDVSGFEN SEGSLTGGIQ 
ATGNYPGKAR NASELRADLE QAMRLIPGPK RLNLHAIYLE SDTPVSRDQI KPEHFKNWVE
WAKANQLGLD FNPSCFSHPL SADGFTLSHA DDSIRQFWID HCKASRRVSA YFGEQLGTPS
VMNIWIPDGM KDITVDRLAP RQRLLAALDE VISEKLNPAH HIDAVESKLF GIGAESYTVG
SNEFYMGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PQLLLHVSRP VRWDSDHVVL
LDDETQAIAS EIVRHDLFDR VHIGLDFFDA SINRIAAWVI GTRNMKKALL RALLEPTAEL
RKLEAAGDYT ARLALLEEQK SLPWQAVWEM YCQRHDTPAG SEWLESVRAY EKEILSRRG