Gene ECH74115_4147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4147 
Symbol 
ID6967062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3833628 
End bp3835688 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content37% 
IMG OID643387895 
Producttype III secretion protein, HrcV family 
Protein accessionYP_002272335 
Protein GI209398095 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4789] Type III secretory pathway, component EscV 
TIGRFAM ID[TIGR01399] type III secretion protein, HrcV family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0033977 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAACA AAGTTTTGGT AGGGTTAAGG AGTCATCCAG AATTAATAAT TCTTGGATTA 
ATGGTGATGA TTATCGCCAT GCTCATAATT CCATTGCCCA CCTATTTAAT CGACTTTTTG
ATTGGGCTTA ACTTAACATT GGCAATACTT GTATTTTTAG GATCATTCTA TGTTGATAGG
ATCTTAAGCT TTTCTTCATT TCCATCAATC CTCCTTATCA CAACGTTATT TCGTTTGGCG
CTGGCGATCA GTACCAGTCG ACTTATACTT CTTGAGGCGG ATGCTGGAGA AATTATAACG
AGTTTTGGTG AGTTCGTTAT TGGAGATAGC CTGGTAGTTG GTTTTGTTAT TTTCTCCATT
GTTACTATTG TTCAGTTTAT CGTGATTACT AAAGGTTCGG AACGTGTGGC TGAAGTTGCT
GCCCGCTTTT CGCTTGACGG TATGCCAGGT AAACAAATGA GCATTGACGC CGATCTACGC
GCCGGGATCA TTGATGCAGA TTTAGCAAAA GAACGTCGTA GTGTATTAGA ACGAGAGAGC
CAGTTGTATG GTTCCTTTGA TGGTGCGATG AAATTTATCA AAGGTGATGC TATTGCAAAC
ATTATTATTA TCTTTGTTAA TATTATCGGT GGGCTATCAG TAGGAGTAGG CCAAAATGGA
ATGGATTTCT CAACAGCGTT GACTGTCTAC ACAATTCTTA CTGTCGGTGA TGGTCTTGTT
TCTCAAATTC CAGCTTTATT AATTGCTATT AGCGCAGGAT TTATCGTTAC GCGCGTAAAT
GGCGATAGTG ATAATATGGG GCAAAATATA ATGTCCCAGT TATTAAGTAA TTCTTTTGTC
ATTGTTGTGA CATGTGTCCT CGCGCTAAGT ATTGGGTTGC TTCCAGGTTT TCCCCTTCTA
GTGTTTTTAT GTTTGGCAGT AATCTTAGGA ATTTACTTTT ACTTTAAATT CAAAAAGAAA
GGGACTGAAG AAACGGTTGT TGAGGGAGAT ATTACCGTTG GGTTGGAACC CTATAATCAA
GATGATGATA TCTCATTAGG TATCATTAAT AAACTTGATC AGGTTATTAC CGAAACGGTA
CCCTTGGTCT TGATTATGAA TAGTGTACAG GCTAAAAAAT ACACTGAAAT CAATCTTGCT
GATAGGATTC GTAGTCAGTT TTTTATTGAG TATGGTATCC GCATTCCCGG TATTGTCATT
CGTGAGGGGG AAGGTCTTAA TGATGAAGAT GTCATATTAA TGCTAAATGA AGTCAGAGCT
TCACAATTTA AAATTTATCA TGATCTTGTC TTGTTGGTTG AATATTCAGA TGAAGTGGTT
TCGACTTTGA TCAAGAAACC TGTCATTGTA AATAGTAATG GTGAACAATA TTATTGGGTT
ACGAAAAGTG ATGCTCAAAA ATTAACAAAA ATTGGCTGTT ATACTCGCAC GGCCATGGAT
GAAATGTATA ATCATTTATC TGTCTGTCTG GCTCATAATA TTAATGAGTA CTTTGGTATA
CAAGAAACAA AATATATTCT TGATCAATTA GAGATGAAGT ATTCTGATCT ATTAAAAGAA
ATATTAAGAT ATATAACCGT GCAGCGCATT TCAGAGGTAA TTCAGCGTCT GATCCAGGAG
CGAATTTCAG TGCGCAATAT GCGGTTAGTT ATGGAGGCCT TAGCATTATG GAGCCCACGA
GAGAAAGATA TAATCACACT CGTCGAGCAT GTACGAGGTG CATTAGGTCG CTACATTTGC
CATAAGTTTT CATATTCTGG AGAAATTAAA GCAATTGTGA TTTCGCCTGA AATAGAGGAC
AGAATCAGAG ATGGTGTCAG GCCTACCGCT GGCGGTACTT TTCTTAATCT CGATGCATCC
GAAGCTGAGA TGATTCTGGA TAATTTTAAA CTGGCACTTT CTGGAATCAA TATTCCTATT
AAAGACATTA TTTTACTGGG ATCTGTGGAT ATTCGTCGTT TTATCAAAAA ACTTATTGAA
TCTAGTTATC GAGACCTCGA AGTACTTTCG TACGGGGAAC TAACGGAAAA TGTTCCCGTT
AATATTCTGA AAACTATTTA G
 
Protein sequence
MFNKVLVGLR SHPELIILGL MVMIIAMLII PLPTYLIDFL IGLNLTLAIL VFLGSFYVDR 
ILSFSSFPSI LLITTLFRLA LAISTSRLIL LEADAGEIIT SFGEFVIGDS LVVGFVIFSI
VTIVQFIVIT KGSERVAEVA ARFSLDGMPG KQMSIDADLR AGIIDADLAK ERRSVLERES
QLYGSFDGAM KFIKGDAIAN IIIIFVNIIG GLSVGVGQNG MDFSTALTVY TILTVGDGLV
SQIPALLIAI SAGFIVTRVN GDSDNMGQNI MSQLLSNSFV IVVTCVLALS IGLLPGFPLL
VFLCLAVILG IYFYFKFKKK GTEETVVEGD ITVGLEPYNQ DDDISLGIIN KLDQVITETV
PLVLIMNSVQ AKKYTEINLA DRIRSQFFIE YGIRIPGIVI REGEGLNDED VILMLNEVRA
SQFKIYHDLV LLVEYSDEVV STLIKKPVIV NSNGEQYYWV TKSDAQKLTK IGCYTRTAMD
EMYNHLSVCL AHNINEYFGI QETKYILDQL EMKYSDLLKE ILRYITVQRI SEVIQRLIQE
RISVRNMRLV MEALALWSPR EKDIITLVEH VRGALGRYIC HKFSYSGEIK AIVISPEIED
RIRDGVRPTA GGTFLNLDAS EAEMILDNFK LALSGINIPI KDIILLGSVD IRRFIKKLIE
SSYRDLEVLS YGELTENVPV NILKTI