Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3315 |
Symbol | |
ID | 6968255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3048656 |
End bp | 3050470 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387127 |
Product | ABC transporter, periplasmic solute-binding protein |
Protein accession | YP_002271591 |
Protein GI | 209399185 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.644955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.133502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGTGC GCATACTGCT GCTGTTTATC GCTCTGTTCA CCTTTGGTGC GCAGGCGCAG GCTATCAAGG AAAGCTATGC TTTTGCCGTG CTGGGCGAAC CCCGGTACGC GTTTAATTTC AACCATTTTG ATTATGTGAA CCCCGCCGCG CCAAAAGGTG GGCAGATAAC GTTGTCAGCC CTCGGTACCT TCGATAATTT CAACCGCTAT GCACTGCGCG GCAACCCGGG CGCACGCACC GAGCAGCTGT ACGACACGCT ATTTACGACT TCCGATGACG AACCAGGCAG TTATTACCCG CTGATTGCTG AAAGCGCACG CTATGCTGAC GATTATTCCT GGGTGGAGGT CGCTATTAAT CCTCGCGCCC GTTTTCATGA TGGTTCGCCC ATTACTGCCC GCGATGTAGA GTTTACTTTT CAAAAATTTA TGACTGAAGG CGTGCCGCAA TTTCGTCTGG TCTACAAAGG CACCACCGTC AAAGCCATTG CGCCGTTAAC CGTGCGCATT GAGTTAGCTA AACCCGGCAA AGAAGATATG TTGAGTCTGT TTTCGCTGCC GGTATTTCCA GAAAAGTACT GGAAGGATCA CAAACTTAGC GATCCGCTCG CCACGCCTCC GCTTGCCAGT GGTCCGTACC GAATTACGTC CTGGAAAATG GGGCAAAATA TTGTCTATTC CCGCGTGAAA GATTACTGGG CAGCAAACTT ACCGGTAAAC CGTGGACGCT GGAATTTCGA CACCATTCGC TACGATTATT ACCTCGATGA TAATGTCGCC TTTGAAGCGT TTAAAGCAGG TGCCTTTGAT TTGCGTATGG AAAACGACGC CAAAAACTGG GCCACACGCT ATAGCGGTAA AAATTTCGAT AAAAAATACA TCATCAAAGA TGAGCAAAAG AACGAATCAG CCCAGGATAC GCGCTGGCTG GCGTTTAATA TCCAACGTCC GGTATTCAGC GATCGCCGGG TCCGGGAAGC AATCACCCTC GCCTTTGACT TTGAATGGAT GAACAAGGCG TTGTTTTACA ATGCCTGGAG TCGCACGAAC AGTTATTTTC AGAATACCGA ATACGCGGCC AGAAATTACC CCGACGCCGC GGAGCTGGTG CTTCTGGCAC CAATGAAAAA AGATTTACCG CCAGAAGTCT TCACACAAAT CTACCAGCCG CCGGTATCGA AAGGCGATGG CTACGATCGT GACAACCTGT TAAAAGCCGA CAAACTTCTC AACGAAGCGG GCTGGGTGCT GAAAGGTCAG CAACGCGTTA ATGCCACTAC GGGTCAGCCG CTCAGCTTTG AATTATTGCT TCCTGCAAGC AGCAATAGTC AGTGGGTATT GCCGTTCCAG CACAGCCTGC AACGGCTGGG TATCAACATG GACATTCGCA AGGTGGATCA CTCGCAAATC ACCAACCGCA TGCGCAGTCG CGACTATGAC ATGATGCCGC GCCTATGGCG GGCGATGCCG TGGCCCAGTT CCGATTTACA GATTTCCTGG TCATCGGAAT ATATCAATTC CACTTATAAT GCCCCCGGCG TGCAAAGCCC GGTTATCGAC TCGCTGATCA ACCAAATTAT TGCCGCGCAG GGAAATAAAG AAAAATTACT GCCGTTAGGG CGAGCACTGG ATCGCGTATT AACGTGGAAT TATTACATGC TGCCAATGTG GTACATGGCG GAAGACCGTC TCGCCTGGTG GGATAAATTC TCCCAACCCG CTGTGCGCCC TGTTTACAGT CTGGGTATCG ATACCTGGTG GTATGATGTT AATAAAGCGG CCAAACTGCC GTCAGCCAGA CAACAGGGAG AGTAG
|
Protein sequence | MIVRILLLFI ALFTFGAQAQ AIKESYAFAV LGEPRYAFNF NHFDYVNPAA PKGGQITLSA LGTFDNFNRY ALRGNPGART EQLYDTLFTT SDDEPGSYYP LIAESARYAD DYSWVEVAIN PRARFHDGSP ITARDVEFTF QKFMTEGVPQ FRLVYKGTTV KAIAPLTVRI ELAKPGKEDM LSLFSLPVFP EKYWKDHKLS DPLATPPLAS GPYRITSWKM GQNIVYSRVK DYWAANLPVN RGRWNFDTIR YDYYLDDNVA FEAFKAGAFD LRMENDAKNW ATRYSGKNFD KKYIIKDEQK NESAQDTRWL AFNIQRPVFS DRRVREAITL AFDFEWMNKA LFYNAWSRTN SYFQNTEYAA RNYPDAAELV LLAPMKKDLP PEVFTQIYQP PVSKGDGYDR DNLLKADKLL NEAGWVLKGQ QRVNATTGQP LSFELLLPAS SNSQWVLPFQ HSLQRLGINM DIRKVDHSQI TNRMRSRDYD MMPRLWRAMP WPSSDLQISW SSEYINSTYN APGVQSPVID SLINQIIAAQ GNKEKLLPLG RALDRVLTWN YYMLPMWYMA EDRLAWWDKF SQPAVRPVYS LGIDTWWYDV NKAAKLPSAR QQGE
|
| |