Gene ECH74115_B0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0020 
Symbol 
ID6966412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp5883 
End bp7322 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content37% 
IMG OID643383927 
Productleukotoxin secretion protein D 
Protein accessionYP_002268406 
Protein GI209395593 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTTT ACATGAAAGG TTTATGGGAT TTGGTTTGCC GTTATAAGAC GGTTTTTTCT 
GATGTCTGGA AAATTCGTCA CACACTTGAT GCTCCTGTGA GAGAAAAAGA TGAGTATGCA
TTTCTTCCTG CTCACCTTGA ACTCATTGAA ACACCAGTAT CCAGACGTTC TCATTTTGTT
GTATGGAGTA TTTTATTATT TGTAATTATA TCTCTTCTCT TATCTGTTCT TGGGAAAGTT
GAAGTGGTTT CAGTAGCAAA TGGTAAGTTT ACTCATAGTG GAAGAAGCAA AGAAATAAAA
CCGATAGAGA ATGCGATTGT AGAAAAAATA ATGGTAAAAG ATGGCTCTTT TGTAAAAAAA
AATGATCCAT TAGTTGAATT GACGGTGCCT GGTGTTGAAT CTGATATTTT AAAATCAGAA
GCATCTTTGT TGTATGAAAA AACAGAACAA TATCGCTACG CAATTCTTTC TGAATCAATA
CAACGGAATG AGCTTCCTGA AATAAGGATA ACAGATTTTC CTGGCGGAGA AGATAATGCG
GGAGGTGAAC ATTTTCAGAG GGTTAGCTCA TTAATAAAAG AGCAATTCAT GACATGGCAG
AACAGAAAAA ATCAAAAGCA GATCACATTA AATAAAAAAA TAGTTGAACG GGATGCAGCG
CTTGCTCGTG TTAGCCTGTA TGAGCATCAG GTATCACAGG AAGGAAGAAA ACTCAATGAT
TTTAAGTATT TGTTGAATAA AAAAGCTGTT TCTCAACATT CAGTTATGGA GCAGGAAAAT
AGCTATATTC AGGCAAAAAA TGAACATGCA GTCTGGCTTG CACAGGTTTC TCAACTTGAA
AAAGAAATAG AACTTGTGCG GGAAGAACTG GCACTGGAGA CGAATATCTT CAGAAGTGAA
ATTATCGAGA AGCACAGAAA ATCAACAGAT AACATTGTGT TGCTGGAGCA TGAGCTTGAA
AAAAACAGGC AGAGAAAAGC ATCATCTTTT ATTAAAGCTC CTGTGAGTGG TACTGTTCAG
GAGTTAAATA TACATACAGA AGGTGGTGTT GTCACAACAG CAGAAACGCT GATGATTATT
GTCCCTGATA ATGATATTCT CGAAGTAACA GCATCTGTAC TTAACAAGGA TATCGGTTTT
ATACAACCTG GACAGGAGGT CGTTATTAAA GTAGATGCAT ATCCATATAC ACGTCATGGT
TACCTTACAG GGAAAGTAAA AAATATAACT GCAGATTCTG TTTCTGTTCC GGATACAGGG
CTTGTATTTA ACGTGATTAT ATCTGTTGAT CGGAATGATA TACAGGGAGA AAGAAAAAAA
ATCCCTGTTA CAGCTGGAAT GACTGTTATG GCAGAAATAA AAACCGGGGT GCGTAGTGTT
ATCAGCTATC TTCTTAGTCC ATTAAAGGAA ACTATTAATG AAAGTTTACG TGAACGTTAG
 
Protein sequence
MRFYMKGLWD LVCRYKTVFS DVWKIRHTLD APVREKDEYA FLPAHLELIE TPVSRRSHFV 
VWSILLFVII SLLLSVLGKV EVVSVANGKF THSGRSKEIK PIENAIVEKI MVKDGSFVKK
NDPLVELTVP GVESDILKSE ASLLYEKTEQ YRYAILSESI QRNELPEIRI TDFPGGEDNA
GGEHFQRVSS LIKEQFMTWQ NRKNQKQITL NKKIVERDAA LARVSLYEHQ VSQEGRKLND
FKYLLNKKAV SQHSVMEQEN SYIQAKNEHA VWLAQVSQLE KEIELVREEL ALETNIFRSE
IIEKHRKSTD NIVLLEHELE KNRQRKASSF IKAPVSGTVQ ELNIHTEGGV VTTAETLMII
VPDNDILEVT ASVLNKDIGF IQPGQEVVIK VDAYPYTRHG YLTGKVKNIT ADSVSVPDTG
LVFNVIISVD RNDIQGERKK IPVTAGMTVM AEIKTGVRSV ISYLLSPLKE TINESLRER