Gene ECH74115_B0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0019 
Symbol 
ID6966448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp3759 
End bp5879 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content37% 
IMG OID643383926 
Producttype I secretion system ATPase family protein 
Protein accessionYP_002268405 
Protein GI209395565 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR01846] type I secretion system ABC transporter, HlyB family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGTA AATGTAGTTC TCATAATAGT CTGTATGCAC TGATATTGCT TGCACAATAT 
CATAATATAA CTGTCAATGC TGAAACTATA AGGCATCAGT ATAATACCCA CACACAAGAT
TTTGGGGTGA CTGAATGGTT ACTGGCAGCG AAATCTATTG GCTTAAAAGC AAAATATGTA
GAAAAACATT TTTCCAGATT GTCAATAATT TCTTTACCTG CGTTGATATG GCGGGATGAC
GGTAAGCATT ATATATTGTC TCGTATTACT AAAGATTCAT CACGCTATCT TGTTTATGAT
CCAGAACAAC ATCAGTCACT AACTTTTAGT CGGGATGAGT TTGAAAAACT GTATCAGGGA
AAAGTCATTC TGGTTACGTC AAGAGCAACA GTAGTCGGAG AGTTAGCTAA ATTTGATTTT
TCTTGGTTTA TCCCCTCTGT TGTGAAATAC AGGAGGATTT TACTTGAGGT GTTAACTGTT
TCTGCTTTTA TTCAGTTTCT TGCGTTAATA ACACCTCTTT TTTTTCAGGT TGTAATGGAT
AAGGTTTTAG TTCACCGGGG GTTTTCAACG TTAAATATTA TCACAATAGC ATTTATTATA
GTGATACTTT TTGAAGTGAT ATTAACCGGA GCCAGAACTT ATATTTTCTC TCATACTACA
AGTCGTATTG ACGTCGAACT GGGTGCTAAG TTATTCAGAC ATTTGCTTGC ATTGCCTGTT
TCATATTTTG AAAATCGCAG GGTCGGAGAG ACCGTTGCCA GAGTAAGGGA ACTGGAGCAA
ATTCGTAATT TTTTAACCGG ACAAGCGTTG ACATCAGTTC TTGATCTATT TTTTTCTGTA
ATATTTTTTT GTGTCATGTG GTATTACAGC CCTCAATTAA CACTGGTTAT ATTATTGTCA
CTACCTTGTT ATGTTATATG GTCATTGTTT ATATCACCCT TATTACGTCG ACGTCTTGAT
GATAAGTTTC TCAGGAATGC AGAAAATCAA GCTTTTCTTG TCGAAACGGT AACAGCAATA
AATACAATCA AATCCATGGC AGTATCACCA CAGATGATTG CTACATGGGA TAAACAACTG
GCCGGGTATG TTGCTTCCAG TTTCAGGGTA AATCTGGTTG CAATGACAGG GCAGCAGGGG
ATACAGCTGA TACAGAAAAG TGTAATGGTA ATTAGTCTAT GGATGGGAGC ACATCTTGTC
ATATCGGGAG AGATAAGCAT CGGGCAGTTA ATCGCGTTTA ATATGCTTGC AGGTCAGGTC
ATTGCTCCTG TTATCAGACT GGCTCATCTT TGGCAGGATT TTCAGCAGGT TGGTATTTCT
GTTGAGCGCC TTGGTGATGT ATTAAATACA CCGGTAGAGA AAAAGTCAGG CAGAAATATA
CTGCCGGAAA TTCAGGGGGA TATCGAATTT AAAAATGTCA GATTTCGGTA TTCTTCTGAC
GGTAATGTTA TTTTGAATAA TATTAATTTA TACATATCAA AGGGGGATGT TATCGGTATA
GTTGGCCGTT CTGGTTCAGG AAAAAGTACA TTAACAAAAC TGCTTCAGCG CTTTTATATA
CCAGAGACCG GACAGATTTT AATTGATGGG CATGATTTAT CACTTGCAGA TCCAGAATGG
TTACGACGCC AGATTGGTGT TGTATTGCAG GAAAATATAC TATTAAATCG TAGTATTATC
GATAATATTA CATTAGCTTC TCCGGCTGTA TCTATGGAAC AGGCTATTGA GGCAGCCAGA
CTTGCAGGTG CCCATGATTT TATTAGAGAA CTAAAAGAAG GGTACAATAC TATTGTTGGA
GAACAGGGCG TTGGTCTTTC AGGAGGGCAG CGCCAACGGA TCGCAATAGC CCGGGCTCTT
GTTACAAATC CTCGAATTCT TATTTTTGAT GAAGCAACCA GTGCTCTTGA TTATGAGTCG
GAAAATATAA TAATGAAAAA TATGTCAAGA ATATGTAAGA ACAGAACCGT AATTATTATT
GCACACAGGT TGTCAACTGT AAAAAATGCA AACAGAATAA TTGTTATGGA TAATGGCTTT
ATTTCTGAAG ATGGCACACA TAAAGAGCTT ATCTCCAAAA AAGACAGTTT ATATGCATAT
TTATATCAGT TGCAGGCATA A
 
Protein sequence
MMSKCSSHNS LYALILLAQY HNITVNAETI RHQYNTHTQD FGVTEWLLAA KSIGLKAKYV 
EKHFSRLSII SLPALIWRDD GKHYILSRIT KDSSRYLVYD PEQHQSLTFS RDEFEKLYQG
KVILVTSRAT VVGELAKFDF SWFIPSVVKY RRILLEVLTV SAFIQFLALI TPLFFQVVMD
KVLVHRGFST LNIITIAFII VILFEVILTG ARTYIFSHTT SRIDVELGAK LFRHLLALPV
SYFENRRVGE TVARVRELEQ IRNFLTGQAL TSVLDLFFSV IFFCVMWYYS PQLTLVILLS
LPCYVIWSLF ISPLLRRRLD DKFLRNAENQ AFLVETVTAI NTIKSMAVSP QMIATWDKQL
AGYVASSFRV NLVAMTGQQG IQLIQKSVMV ISLWMGAHLV ISGEISIGQL IAFNMLAGQV
IAPVIRLAHL WQDFQQVGIS VERLGDVLNT PVEKKSGRNI LPEIQGDIEF KNVRFRYSSD
GNVILNNINL YISKGDVIGI VGRSGSGKST LTKLLQRFYI PETGQILIDG HDLSLADPEW
LRRQIGVVLQ ENILLNRSII DNITLASPAV SMEQAIEAAR LAGAHDFIRE LKEGYNTIVG
EQGVGLSGGQ RQRIAIARAL VTNPRILIFD EATSALDYES ENIIMKNMSR ICKNRTVIII
AHRLSTVKNA NRIIVMDNGF ISEDGTHKEL ISKKDSLYAY LYQLQA