Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2473 |
Symbol | |
ID | 6969544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2341879 |
End bp | 2343045 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643386342 |
Product | putative ABC transporter solute-binding protein |
Protein accession | YP_002270824 |
Protein GI | 209399720 |
COG category | [R] General function prediction only |
COG ID | [COG4134] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCATT GTGGGTGGTT GCTGGGATTG TTATCGCTGT TTTCTCTGGC AACACATGCC AGTGACTGGC AAGAAATTAA AAATGAGGCC AAAGGGCAAA CTGTCTGGTT TAACGCCTGG GGCGGCGATA CCGCAATTAA CCGCTATCTC GACTGGGTTA GCGGCGAGAT GAAAACCCAT TACGCTATAA ACCTGAAGAT TGTTCGTCTG GCGGATGCCG CAGACGCGGT GAAGCGCATT CAGACCGAAG CTGCTGCCGG ACGTAAAACG GGCGGCTCGG TGGATCTGCT CTGGGTGAAC GGCGAAAACT TCCGCACCTT AAAAGAGGCC AATTTACTGC AAACCGACTG GGCAGAGACT CTGCCCAACT GGCGCTATGT CGACACACAG CTGCCGGTGC GGGAAGATTT TTCAGTGCCT ACAGAAGGGG CTGAATCGCC CTGGGGGGGC GCACAACTGA CATTTATCGC CCGCCGCGAT GTTACGCCAC AGCCACCACA AACGCCGCAA GCCTTACTGG AGTTTGCTAA AGCCAATCCC GGCACGGTTA CCTATCCGCG CCCACCGGAC TTTACCGGCA CGGCGTTTCT TGAACAGTTG CTGATTATGC TGACGCCCGA TCCCGCCGCA TTAAATGAAG CGCCGAACGA TGCGACTTTC GCCCGTGTCA CTGCTCCCTT GTGGCAATAT CTTGATGCGC TGCATCCGTA TTTGTGGCGC GAAGGAAAGG ATTTCCCGCC TTCGCCCGCG CGGATGGATG CTCTGCTGAA AGCCGGAACA TTTCGCCTGT CGCTGACCTT TAACCCCGCG CATGCGCAGC AAAAAATCGC CAGCGGCGAT TTGCCTGCAA GCAGTTACAG TTTTGGCTTT CGCGAGGGGA TGATTGGCAA CGTGCATTTC GTCACCATTC CTGCCAACGC GAATGCCAGT GCTGCGGCGA AGGTAGTTGC CAATTTCTTG CTCTCACCCG ATGCGCAATT GCGTAAAGCA GATCCCGCTG TCTGGGGCGA TCCTTCTGTT CTCGATCCGC AAAAACTGCC TGATGGGCAG CGCGAAACAT TGCAATCAAG AATGCCGCAA GATCTGCCGC CGGTACTGGC TGAACCGCAC GCAGGTTGGG TAAATGCGCT GGAACAAGAA TGGCTACACC GTTACGGTAC GCATTAA
|
Protein sequence | MRHCGWLLGL LSLFSLATHA SDWQEIKNEA KGQTVWFNAW GGDTAINRYL DWVSGEMKTH YAINLKIVRL ADAADAVKRI QTEAAAGRKT GGSVDLLWVN GENFRTLKEA NLLQTDWAET LPNWRYVDTQ LPVREDFSVP TEGAESPWGG AQLTFIARRD VTPQPPQTPQ ALLEFAKANP GTVTYPRPPD FTGTAFLEQL LIMLTPDPAA LNEAPNDATF ARVTAPLWQY LDALHPYLWR EGKDFPPSPA RMDALLKAGT FRLSLTFNPA HAQQKIASGD LPASSYSFGF REGMIGNVHF VTIPANANAS AAAKVVANFL LSPDAQLRKA DPAVWGDPSV LDPQKLPDGQ RETLQSRMPQ DLPPVLAEPH AGWVNALEQE WLHRYGTH
|
| |