Gene ECH74115_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4149 
Symbol 
ID6972017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3836831 
End bp3838534 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content39% 
IMG OID643387897 
Producttype III secretion outer membrane pore, YscC/HrcC family 
Protein accessionYP_002272337 
Protein GI209395847 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02516] type III secretion outer membrane pore, YscC/HrcC family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0153519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA AATTACGCAT TACTATTATA TTAATTTCAG TCTTATGCAT TTTTAATGGA 
TTATTGACTC CTGGTGCATA TGCCGCAGCA GCGAATGGAT ACGTAGCAAA TAAAGAAAAT
CTACGTAGTT TTTTTGAAAC GGTTTCATCA TATGCGGGTA AGCCTACAAT AGTTAGTAAG
CTTGCCATGA AGAAACAGAT CAGCGGAAAT TTTGATTTAA CAGAACCCTA TGCCTTGATT
GAACGCCTGT CGGCACAGAT GGGGCTTATT TGGTATGATG ATGGCAAAGC TATTTATATC
TATGACTCAT CAGAGATGCG TAATGCATTG ATTAATCTGA GAAAGGTATC GACGAATGAG
TTTAATAATT TCTTAAAAAA ATCGGGTCTG TATAACTCTC GTTATGAAAT TAAAGGGGAC
GGCAATGGTA CTTTCTATGT ATCGGGGCCA CCGGTTTATG TTGACCTGGT CGTTAATGCT
GCGAAATTAA TGGAGCAAAA CAGCGATGGC ATTGAGATCG GTCGAAATAA AGTAGGAATA
ATTCATCTGG TCAACACATT CGTTAATGAC CGGACTTATG AGTTACGTGG CGAAAAGATA
GTTATTCCTG GCATGGCAAA AGTACTATCG ACGTTGTTAA ATAATAACAT TAAGCAAAGC
ACAGGGGTGA ACGTACTTTC TGAAATTTCA AGTCGTCAAC AACTAAAAAA CGTATCACGT
ATGCCTCCTT TTCCTGGGGC TGAAGAAGAT GATGACCTAC AAGTAGAGAA AATAATATCG
ACAGCAGGAG CGCCAGAGAC TGACGATATA CAAATTATTG CATATCCTGA TACCAACAGT
TTACTTGTTA AAGGAACAGT ATCTCAGGTT GATTTTATCG AGAAGTTAGT AGCTACGCTT
GATATTCCAA AGCGACACAT TGAATTATCT TTATGGATAA TAGATATTGA TAAGACTGAT
CTAGAACAGC TAGGTGCTGA CTGGTCGGGC ACGATAAAAA TTGGATCCAG CTTAAGTGCA
TCATTTAATA ATTCAGGTTC AATTAGTACC CTTGATGGGA CACAGTTTAT TGCAACAATA
CAGGCACTTG CACAAAAAAG AAGAGCCGCC GTTGTTGCTC GTCCCGTCGT TTTAACACAA
GAGAATATTC CTGCTATATT TGATAATAAC CGCACTTTCT ATACTAAATT AGTCGGCGAG
CGTACTGCGG AATTGGATGA AGTAACCTAT GGTACGATGA TTAGCGTGCT ACCACGATTT
GCTGCGCGAA ATCAAATTGA ATTACTCTTG AATATTGAAG ATGGAAATGA AATCAATTCC
GACAAGACCA ATGTTGATGA TCTGCCTCAG GTTGGTAGAA CACTTATAAG TACTATTGCA
CGCGTGCCCC AAGGGAAAAG TCTTTTGATT GGGGGATATA CACGCGATAC GAATACCTAT
GAAAGTCGCA AGATCCCAAT ATTAGGCAGC ATTCCATTTA TTGGTAAATT ATTTGGTTAT
GAAGGAACAA ATGCGAATAA CATCGTTCGT GTCTTTCTTA TTGAGCCAAG AGAGATCGAT
GAACGCATGA TGAATAATGC GAATGAGGCT GCGGTTGATG CCAGAGCGAT TACACAGCAA
ATGGCAAAAA ATAAAGAAAT CAACGATGAA TTACTGCAGA AATGGATAAA AACTTACCTG
AATCGTGAAG TCGTCGGAGG ATAG
 
Protein sequence
MKIKLRITII LISVLCIFNG LLTPGAYAAA ANGYVANKEN LRSFFETVSS YAGKPTIVSK 
LAMKKQISGN FDLTEPYALI ERLSAQMGLI WYDDGKAIYI YDSSEMRNAL INLRKVSTNE
FNNFLKKSGL YNSRYEIKGD GNGTFYVSGP PVYVDLVVNA AKLMEQNSDG IEIGRNKVGI
IHLVNTFVND RTYELRGEKI VIPGMAKVLS TLLNNNIKQS TGVNVLSEIS SRQQLKNVSR
MPPFPGAEED DDLQVEKIIS TAGAPETDDI QIIAYPDTNS LLVKGTVSQV DFIEKLVATL
DIPKRHIELS LWIIDIDKTD LEQLGADWSG TIKIGSSLSA SFNNSGSIST LDGTQFIATI
QALAQKRRAA VVARPVVLTQ ENIPAIFDNN RTFYTKLVGE RTAELDEVTY GTMISVLPRF
AARNQIELLL NIEDGNEINS DKTNVDDLPQ VGRTLISTIA RVPQGKSLLI GGYTRDTNTY
ESRKIPILGS IPFIGKLFGY EGTNANNIVR VFLIEPREID ERMMNNANEA AVDARAITQQ
MAKNKEINDE LLQKWIKTYL NREVVGG