Gene ECH74115_2473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2473 
Symbol 
ID6969544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2341879 
End bp2343045 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content55% 
IMG OID643386342 
Productputative ABC transporter solute-binding protein 
Protein accessionYP_002270824 
Protein GI209399720 
COG category[R] General function prediction only 
COG ID[COG4134] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATT GTGGGTGGTT GCTGGGATTG TTATCGCTGT TTTCTCTGGC AACACATGCC 
AGTGACTGGC AAGAAATTAA AAATGAGGCC AAAGGGCAAA CTGTCTGGTT TAACGCCTGG
GGCGGCGATA CCGCAATTAA CCGCTATCTC GACTGGGTTA GCGGCGAGAT GAAAACCCAT
TACGCTATAA ACCTGAAGAT TGTTCGTCTG GCGGATGCCG CAGACGCGGT GAAGCGCATT
CAGACCGAAG CTGCTGCCGG ACGTAAAACG GGCGGCTCGG TGGATCTGCT CTGGGTGAAC
GGCGAAAACT TCCGCACCTT AAAAGAGGCC AATTTACTGC AAACCGACTG GGCAGAGACT
CTGCCCAACT GGCGCTATGT CGACACACAG CTGCCGGTGC GGGAAGATTT TTCAGTGCCT
ACAGAAGGGG CTGAATCGCC CTGGGGGGGC GCACAACTGA CATTTATCGC CCGCCGCGAT
GTTACGCCAC AGCCACCACA AACGCCGCAA GCCTTACTGG AGTTTGCTAA AGCCAATCCC
GGCACGGTTA CCTATCCGCG CCCACCGGAC TTTACCGGCA CGGCGTTTCT TGAACAGTTG
CTGATTATGC TGACGCCCGA TCCCGCCGCA TTAAATGAAG CGCCGAACGA TGCGACTTTC
GCCCGTGTCA CTGCTCCCTT GTGGCAATAT CTTGATGCGC TGCATCCGTA TTTGTGGCGC
GAAGGAAAGG ATTTCCCGCC TTCGCCCGCG CGGATGGATG CTCTGCTGAA AGCCGGAACA
TTTCGCCTGT CGCTGACCTT TAACCCCGCG CATGCGCAGC AAAAAATCGC CAGCGGCGAT
TTGCCTGCAA GCAGTTACAG TTTTGGCTTT CGCGAGGGGA TGATTGGCAA CGTGCATTTC
GTCACCATTC CTGCCAACGC GAATGCCAGT GCTGCGGCGA AGGTAGTTGC CAATTTCTTG
CTCTCACCCG ATGCGCAATT GCGTAAAGCA GATCCCGCTG TCTGGGGCGA TCCTTCTGTT
CTCGATCCGC AAAAACTGCC TGATGGGCAG CGCGAAACAT TGCAATCAAG AATGCCGCAA
GATCTGCCGC CGGTACTGGC TGAACCGCAC GCAGGTTGGG TAAATGCGCT GGAACAAGAA
TGGCTACACC GTTACGGTAC GCATTAA
 
Protein sequence
MRHCGWLLGL LSLFSLATHA SDWQEIKNEA KGQTVWFNAW GGDTAINRYL DWVSGEMKTH 
YAINLKIVRL ADAADAVKRI QTEAAAGRKT GGSVDLLWVN GENFRTLKEA NLLQTDWAET
LPNWRYVDTQ LPVREDFSVP TEGAESPWGG AQLTFIARRD VTPQPPQTPQ ALLEFAKANP
GTVTYPRPPD FTGTAFLEQL LIMLTPDPAA LNEAPNDATF ARVTAPLWQY LDALHPYLWR
EGKDFPPSPA RMDALLKAGT FRLSLTFNPA HAQQKIASGD LPASSYSFGF REGMIGNVHF
VTIPANANAS AAAKVVANFL LSPDAQLRKA DPAVWGDPSV LDPQKLPDGQ RETLQSRMPQ
DLPPVLAEPH AGWVNALEQE WLHRYGTH