Gene ECH74115_3256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3256 
Symbol 
ID6968401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2988799 
End bp2989716 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content53% 
IMG OID643387069 
ProductABC transporter, quaternary amine uptake transporter (QAT) family, substrate-binding protein 
Protein accessionYP_002271533 
Protein GI209399202 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00281885 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACTCT CAAAGGTCTG GGCAGGTTCA CTGGTTTTGT TGGCAGCCGT GAGCCTGCCG 
CTGCACGCGG CTTCCCCCGT TAAAGTCGGT TCAAAAATCG ATACCGAAGG CGCGCTGCTC
GGCAATATCA TTTTGCAAGT ACTCGAAAGC CACGGAGTAC CAACGGTCAA TAAAGTGCAA
CTTGGAACGA CTCCTGTGGT GCGCGGGGCG ATTACTTCCG GTGAACTGGA TATCTATCCG
GAATATACCG GCAATGGCGC GTTTTTCTTT AAAGATGAAA ACGATGCAGC GTGGAAAAAC
GCGCAGCAAG GTTACGAGAA AGTCAAAAAA CTCGATGCAG AGCAAAACAA GTTAATCTGG
CTGACGCCCG CACCTGCAAA TAACACCTGG ACCATCGCCG TGCGTCAGGA TGTGGCAGAG
AAAAACAAAC TCACTTCGCT TGCTGACCTG AGGCGTTATC TGAAAGAGGG CGGCACCTTC
AAACTGGCAG CCTCGGCTGA GTTTATCGAA CGCGCCGATG CGTTACCCGC GTTTGAAAAA
GCCTACGACT TTAAACTCGA TCAGGATCAG TTACTGTCAC TGGCTGGCGG CGACACGGCG
GTAACGATTA AAGCCGCTGC CCAGCAAACT TCTGGCGTTA ATGCCGCAAT GGCTTACGGC
ACTGACGGTC CGGTCGCGGC GCAGGGGCTG CAAACCTTAA GCGATCCGCA AGGCGTGCAA
CCTATCTACG CGCCTGCACC AGTGGTGCGT GAGTCGGTGC TGAAAGAGTA TCCGCAAATG
GCACAGTGGC TACAGCCAGT CTTCGCCAGC CTCGATGCAA AAACATTGCA GCAACTGAAT
GCCAGCATTG CAGTGGAAGG ACTGGATGCC AAAAAAGTGG CTGCCGACTA CTTGAAACAA
AAAGGGTGGA CGAAGTAA
 
Protein sequence
MPLSKVWAGS LVLLAAVSLP LHAASPVKVG SKIDTEGALL GNIILQVLES HGVPTVNKVQ 
LGTTPVVRGA ITSGELDIYP EYTGNGAFFF KDENDAAWKN AQQGYEKVKK LDAEQNKLIW
LTPAPANNTW TIAVRQDVAE KNKLTSLADL RRYLKEGGTF KLAASAEFIE RADALPAFEK
AYDFKLDQDQ LLSLAGGDTA VTIKAAAQQT SGVNAAMAYG TDGPVAAQGL QTLSDPQGVQ
PIYAPAPVVR ESVLKEYPQM AQWLQPVFAS LDAKTLQQLN ASIAVEGLDA KKVAADYLKQ
KGWTK