Gene ECH74115_5601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5601 
Symbol 
ID6972069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5240020 
End bp5241540 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content57% 
IMG OID643389237 
Productribose import ATP-binding protein RbsA 
Protein accessionYP_002273634 
Protein GI209398733 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCA CGCCAGTTCT GGAGATGCGC AATATTGCCA AAGCCTTCGG CAAATTTTAT 
GCACTCAAAG GGGTGGATTT GACGGTCTAC CCTGGCGAGA TCCATGCCCT GATGGGGGAA
AACGGCGCGG GAAAAAGCAC GCTGATGAAG GTGCTGGCGG GGGCGTATAC CGCCACCAGC
GGCGAGATTC TCATAGACGG CAAGCCCTTT CACATTCGCA CGCCAAAAGA TGCCTTAAGC
GCCGGTATTA CGCTGATTTA TCAGGAGATG CAGCTGGCAC CGAATCTGTC GGTGGCAGAA
AATATTTTTC TCGGCAGCGA GCTTTCCCAC GGCGGGCTGG TGCAGCGTAA AGAGATGCTA
GTGCAGGCGC AAAAAGTGAT CGACCGCCTC GGCGCGCAGT TTAACGCCAG CGATAAGGTC
ATGACGCTGA CCATTGCCGA GCAACAGCAG GTCGAAATCG CCCGCGCACT ACATCGCAAC
AGCCGCATTC TGGTGATGGA CGAACCCACC GCTGCCCTCT CCTCCCGCGA AACTCACCGC
CTGTTTGAAC TGATTATGCG GCTGCGCGAT GAAGGGATGG CGATTATCTA CATTAGCCAC
CGCATGGCGG AAGTGTATGA GCTTTCCGAT CGCGTCAGCG TGCTACGCGA CGGGCAATAC
GTTGGCAGCC TGACCCGCGA TAACCTCAAT GCCGGGGAGC TGGTGCGGAT GATGGTCGGC
AGGCCACTGA GCGATCTGTT CAATAAAGAG CGCGATATCC CGCTCGGTAA AGCCCGCCTG
AATGTTCACC ACCTGACCGA CGGCGGCAAA GTCCAGCCGA GTAGCCTGCT GGTGCGTTCC
GGCGAAATTG TTGGCCTCGC CGGACTGGTG GGTGCCGGAC GTTCCGAACT GGCGCAGTTG
ATCTTCGGCG TGCGGAAAGC GACAGGCGGA ATGATTGAAG TCGATGGTGA ACCGGTGGTG
ATCCACTCCC CGCGCGAAGC CATCGATCTT GGCATTGGTT TTCTCACCGA AAACCGCAAA
GAACAAGGCT TATTCCTTGA AATGGCAGCA GCCGAAAACA TCACCATGGC AACCCTGGAG
CGCGATGCCC GCTGGGGAAT GCTCAATCGC AAAAAAGCGC AAACCATTTC CGATGACGCC
ATTAAGTTGC TCAACATTCG CGTGCCTCAT GCCCAGGTAC GCGCGGGCGG GCTTTCCGGC
GGCAATCAGC AAAAACTGTT GATCTCCCGC TGGGTGGCGA TTGGCCCGCG CATTTTACTG
CTCGATGAAC CCGCCCGCGG CGTGGACGTT GGCGCCAAAA GTGAGATCTA CCGGATCATG
AACGAGATGG CGCGCAAGGG CGTGGCGATC CTGATGATCT CCAGCGAACT GCCGGAGATA
GTGGGAATGA GCGATCGCGT CTATGTGATG CGCGAAGGCA GCATTGCCGG TGAGTTAAAC
GGCAAAAACA TCACCCAGGA AAACATTATG ACCTTAGCGA CTGGCGTCAA CGACGCCCAT
TCCCAGGCGG TAACCTCATG A
 
Protein sequence
MNSTPVLEMR NIAKAFGKFY ALKGVDLTVY PGEIHALMGE NGAGKSTLMK VLAGAYTATS 
GEILIDGKPF HIRTPKDALS AGITLIYQEM QLAPNLSVAE NIFLGSELSH GGLVQRKEML
VQAQKVIDRL GAQFNASDKV MTLTIAEQQQ VEIARALHRN SRILVMDEPT AALSSRETHR
LFELIMRLRD EGMAIIYISH RMAEVYELSD RVSVLRDGQY VGSLTRDNLN AGELVRMMVG
RPLSDLFNKE RDIPLGKARL NVHHLTDGGK VQPSSLLVRS GEIVGLAGLV GAGRSELAQL
IFGVRKATGG MIEVDGEPVV IHSPREAIDL GIGFLTENRK EQGLFLEMAA AENITMATLE
RDARWGMLNR KKAQTISDDA IKLLNIRVPH AQVRAGGLSG GNQQKLLISR WVAIGPRILL
LDEPARGVDV GAKSEIYRIM NEMARKGVAI LMISSELPEI VGMSDRVYVM REGSIAGELN
GKNITQENIM TLATGVNDAH SQAVTS