Gene ECH74115_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2971 
Symbol 
ID6966930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2748053 
End bp2749438 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content31% 
IMG OID643386811 
ProductO antigen flippase 
Protein accessionYP_002271279 
Protein GI209400642 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.436606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000000132287 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAAAA TCAAAAAAAT ACTTAAATTT TGCACTTTAA AAAAATATGA TACATCAAGT 
GCTTTAGGTA GAGAACAGGA AAGGTACAGG ATTATATCCT TGTCTGTTAT TTCAAGTTTG
ATTAGTAAAA TACTCTCACT ACTTTCTCTT ATATTAACTG TAAGTTTAAC TTTACCTTAT
TTAGGACAAG AGAGATTTGG TGTATGGATG ACTATTACCA GTCTTGGTGC TGCTCTGACA
TTTTTGGACT TAGGTATAGG AAATGCATTA ACAAACAGGA TCGCACATTC ATTTGCGTGT
GGCAAAAATT TAAAGATGAG TCGGCAAATT AGTGGTGGGC TCACTTTGCT GGCTGGATTA
TCGTTTGTCA TAACTGCAAT ATGCTATATT ACTTCTGGCA TGATTGATTG GCAACTAGTA
ATAAAAGGTA TAAACGAGAA TGTGTATGCA GAGTTACAAC ACTCAATTAA AGTCTTTGTA
ATCATATTTG GACTTGGAAT TTATTCAAAT GGTGTGCAAA AAGTTTATAT GGGAATACAA
AAAGCCTATA TAAGTAATAT TGTTAATGCC ATATTTATAT TGTTATCTAT TATTACTCTA
GTAATATCGT CGAAACTACA TGCGGGACTA CCAGTTTTAA TTGTCAGCAC TCTTGGTATT
CAATACATAT CGGGAATCTA TTTAACAATT AATCTTATTA TAAAGCGATT AATAAAGTTT
ACAAAAGTTA ACATACATGC TAAAAGAGAA GCTCCATATT TGATATTAAA CGGTTTTTTC
TTTTTTATTT TACAGTTAGG CACTCTGGCA ACATGGAGTG GTGATAACTT TATAATATCT
ATAACATTGG GTGTTACTTA TGTTGCTGTT TTTAGCATTA CACAGAGATT ATTTCAAATA
TCTACGGTCC CTCTTACGAT TTATAACATC CCGTTATGGG CTGCTTATGC AGATGCTCAT
GCACGCAATG ATACTCAATT TATAAAAAAG ACGCTCAGAA CATCATTGAA AATAGTGGGT
ATTTCATCAT TCTTATTGGC CTTCATATTA GTAGTGTTCG GTAGTGAAGT CGTTAATATT
TGGACAGAAG GAAAGATTCA GGTACCTCGA ACATTCATAA TAGCTTATGC TTTATGGTCT
GTTATTGATG CTTTTTCGAA TACATTTGCA AGCTTTTTAA ATGGTTTGAA CATAGTTAAA
CAACAAATGC TTGCTGTTGT AACATTGATA TTGATCGCAA TTCCAGCAAA ATACATCATA
GTTAGCCATT TTGGGTTAAC TGTTATGTTG TACTGCTTCA TTTTTATATA TATTGTAAAT
TACTTTATAT GGTATAAATG TAGTTTTAAA AAACATATCG ATAGACAGTT AAATATAAGA
GGATGA
 
Protein sequence
MNKIKKILKF CTLKKYDTSS ALGREQERYR IISLSVISSL ISKILSLLSL ILTVSLTLPY 
LGQERFGVWM TITSLGAALT FLDLGIGNAL TNRIAHSFAC GKNLKMSRQI SGGLTLLAGL
SFVITAICYI TSGMIDWQLV IKGINENVYA ELQHSIKVFV IIFGLGIYSN GVQKVYMGIQ
KAYISNIVNA IFILLSIITL VISSKLHAGL PVLIVSTLGI QYISGIYLTI NLIIKRLIKF
TKVNIHAKRE APYLILNGFF FFILQLGTLA TWSGDNFIIS ITLGVTYVAV FSITQRLFQI
STVPLTIYNI PLWAAYADAH ARNDTQFIKK TLRTSLKIVG ISSFLLAFIL VVFGSEVVNI
WTEGKIQVPR TFIIAYALWS VIDAFSNTFA SFLNGLNIVK QQMLAVVTLI LIAIPAKYII
VSHFGLTVML YCFIFIYIVN YFIWYKCSFK KHIDRQLNIR G