Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5601 |
Symbol | |
ID | 6972069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5240020 |
End bp | 5241540 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643389237 |
Product | ribose import ATP-binding protein RbsA |
Protein accession | YP_002273634 |
Protein GI | 209398733 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCA CGCCAGTTCT GGAGATGCGC AATATTGCCA AAGCCTTCGG CAAATTTTAT GCACTCAAAG GGGTGGATTT GACGGTCTAC CCTGGCGAGA TCCATGCCCT GATGGGGGAA AACGGCGCGG GAAAAAGCAC GCTGATGAAG GTGCTGGCGG GGGCGTATAC CGCCACCAGC GGCGAGATTC TCATAGACGG CAAGCCCTTT CACATTCGCA CGCCAAAAGA TGCCTTAAGC GCCGGTATTA CGCTGATTTA TCAGGAGATG CAGCTGGCAC CGAATCTGTC GGTGGCAGAA AATATTTTTC TCGGCAGCGA GCTTTCCCAC GGCGGGCTGG TGCAGCGTAA AGAGATGCTA GTGCAGGCGC AAAAAGTGAT CGACCGCCTC GGCGCGCAGT TTAACGCCAG CGATAAGGTC ATGACGCTGA CCATTGCCGA GCAACAGCAG GTCGAAATCG CCCGCGCACT ACATCGCAAC AGCCGCATTC TGGTGATGGA CGAACCCACC GCTGCCCTCT CCTCCCGCGA AACTCACCGC CTGTTTGAAC TGATTATGCG GCTGCGCGAT GAAGGGATGG CGATTATCTA CATTAGCCAC CGCATGGCGG AAGTGTATGA GCTTTCCGAT CGCGTCAGCG TGCTACGCGA CGGGCAATAC GTTGGCAGCC TGACCCGCGA TAACCTCAAT GCCGGGGAGC TGGTGCGGAT GATGGTCGGC AGGCCACTGA GCGATCTGTT CAATAAAGAG CGCGATATCC CGCTCGGTAA AGCCCGCCTG AATGTTCACC ACCTGACCGA CGGCGGCAAA GTCCAGCCGA GTAGCCTGCT GGTGCGTTCC GGCGAAATTG TTGGCCTCGC CGGACTGGTG GGTGCCGGAC GTTCCGAACT GGCGCAGTTG ATCTTCGGCG TGCGGAAAGC GACAGGCGGA ATGATTGAAG TCGATGGTGA ACCGGTGGTG ATCCACTCCC CGCGCGAAGC CATCGATCTT GGCATTGGTT TTCTCACCGA AAACCGCAAA GAACAAGGCT TATTCCTTGA AATGGCAGCA GCCGAAAACA TCACCATGGC AACCCTGGAG CGCGATGCCC GCTGGGGAAT GCTCAATCGC AAAAAAGCGC AAACCATTTC CGATGACGCC ATTAAGTTGC TCAACATTCG CGTGCCTCAT GCCCAGGTAC GCGCGGGCGG GCTTTCCGGC GGCAATCAGC AAAAACTGTT GATCTCCCGC TGGGTGGCGA TTGGCCCGCG CATTTTACTG CTCGATGAAC CCGCCCGCGG CGTGGACGTT GGCGCCAAAA GTGAGATCTA CCGGATCATG AACGAGATGG CGCGCAAGGG CGTGGCGATC CTGATGATCT CCAGCGAACT GCCGGAGATA GTGGGAATGA GCGATCGCGT CTATGTGATG CGCGAAGGCA GCATTGCCGG TGAGTTAAAC GGCAAAAACA TCACCCAGGA AAACATTATG ACCTTAGCGA CTGGCGTCAA CGACGCCCAT TCCCAGGCGG TAACCTCATG A
|
Protein sequence | MNSTPVLEMR NIAKAFGKFY ALKGVDLTVY PGEIHALMGE NGAGKSTLMK VLAGAYTATS GEILIDGKPF HIRTPKDALS AGITLIYQEM QLAPNLSVAE NIFLGSELSH GGLVQRKEML VQAQKVIDRL GAQFNASDKV MTLTIAEQQQ VEIARALHRN SRILVMDEPT AALSSRETHR LFELIMRLRD EGMAIIYISH RMAEVYELSD RVSVLRDGQY VGSLTRDNLN AGELVRMMVG RPLSDLFNKE RDIPLGKARL NVHHLTDGGK VQPSSLLVRS GEIVGLAGLV GAGRSELAQL IFGVRKATGG MIEVDGEPVV IHSPREAIDL GIGFLTENRK EQGLFLEMAA AENITMATLE RDARWGMLNR KKAQTISDDA IKLLNIRVPH AQVRAGGLSG GNQQKLLISR WVAIGPRILL LDEPARGVDV GAKSEIYRIM NEMARKGVAI LMISSELPEI VGMSDRVYVM REGSIAGELN GKNITQENIM TLATGVNDAH SQAVTS
|
| |