Gene ECH74115_0584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0584 
Symbol 
ID6971144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp605092 
End bp607254 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content57% 
IMG OID643384626 
Producttype I secretion system ATPase family protein 
Protein accessionYP_002269140 
Protein GI209400615 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR03375] type I secretion system ATPase, LssB family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA ATGCCCAGTC CGTGGAAGCC TGGCTGGAGG CGATGATCGC CGTGGCGCGC 
TATTATCGGC TCGATTTTTC TCAGGAGAAT GTGCGCGCGA CCGTCAACTG GGAGCGCGAC
AGTAAGCGGG AGGAGTTACT CACCGACATG GCGCGGCAGT TAGGGATGGG GTTGCGGTTG
GTGGAGTTTT CCGCCGATTC GCTTAACCCC TGGCGTCTGC CGCTGATCGC GGTGTTTGAT
AATCAGCAGA TTGGCGTGAT CACCCGTCGC GACAATCATG ACAATATCAG CGTTCAGTTC
AGCGGTGACG AAGGGCTGGA AACTACACTT AACGTCGCGG ATATTGAAGA TAAAATCGTC
GAACTGGCGC TGCTGCGCCC TCTCAGCGCC ATCCCTGACG CGCGGGTGGA TGACTATATT
CGCCCGTATC AGGCGAACTG GTTCTGGAGT CTTTCGCTAA AAGACTGGCG GCGCTACGGC
GATATTATGC TGGCGTCACT GGTGGCGAAC GTGCTGGCGC TGGCGGCGAT GATTTTCTCG
ATGCAGGTTT ACGATCGGGT GGTTCCGGCG CAATCGTACC CGACGCTATG GGTGCTTTTC
GCCGGGGTGA TGATGGCGAT CCTGTTTGAG TTCTGTATGC GCATGGTGCG TACGCATCTA
TCTGATGTGA TCGGAAAGCG GGCAGACCTG CGTATTTCGG ATCGCGTTTT TGGTCATGCG
CTACGGCTGA AAAACAACGT GCGATCGAAA TCCACCGGAT CGTTTATCTC GCAGATCCGC
GAACTGGAAT CAGTGCGGGA GCTTATTACG TCCACCACCA TCGGCGCGGT GGCGGACCTG
CCATTCTTCC TGCTGTTTGT CTTTATTTTG TGGATGATCG GCGGCTGGCT GGTGCTGGTG
GTTTTGCTGG CGCTGCTGTT GCTGGTGATC CCCGGCCTGC TGGTGCAACG CCCGCTGGCG
CGGCTGGCGA ACGAAGGAAT GCGCGAGTCA GCGGTACGCA ACGCTACGCT GGTGGAAGCC
GTGCAGTCGA TTGAAGATAT TAAACTGCTG CGCGCCGAAC AGCGTTTTCA GAATCAGTGG
AACCACACCA ACGACGTGGC CTCGTCCATC AGCATGAAGC AACGTTTTCT CACCGGGCTG
CTGCTCACCT GGACCCAGGA AGTGCAGTCC ATCGTCTACG TGGTGGTGCT GCTGGTGGGC
TGCTTTATGG TGATGAACGG CGACATGACC ACCGGCGCGC TGGTCGGGAC GACGATCCTG
GCTTCGCGCA CCATTGCGCC GCTGTCGCAG ATATCCGGCG TGCTCTCGCG CTGGCAGCAG
GCGAAAGTAG CACGTAACGG CCTTGATGAA CTGATGAAGC GCCCGGTGGA TCAGCCGGAA
CACGGCAAAC TGGTGCATAA AGCGGTGCTG CACGGTAATT ATCAGTTCAG CAACGCGGTG
TTTTATTACG ACGAAGAAGA GAAGATTGCC GATGTAGCGA TTGGCAAACT CAACATTCAG
GCTGGTGAGA AAATCGCCAT TCTCGGGCGC AACGGCGCGG GTAAATCGAC GCTACTGCAA
ATGCTCGCCG GAATGCGTAT CGCCCAGCAG GGGCAGGTGT TGCTGGATAA CATCAGTATC
GGGCAGCTTG ATCCGGCAGA TCTGCGGCGA GACATGGGGC TACTCAGCCA GACCGGGCGA
CTGTTCTTTG GTTCGCTGCG GGAAAACCTC ACCATGGGGA TGCCGGAAGC CAGCGATGAA
GATATCGAAC GCGCGTTAAC CCTCAGCGGA GCGCTGCCCT TTGTGCAGAA GCAGAAAAAC
GGCCTCAACT ACATGATTCA GGAAGGGGGA TTTGGCCTCT CTGGTGGTCA GCGGCAAACG
CTGCTGCTGG CGCGGCTACT CATCTCTCAG CCGAATATCG TCCTGCTCGA CGAACCCAGC
GCCTCACTGG ACGAAATGGC AGAAGCGTAT TTAATCGAAC AACTGAAACA GTGGATTGGG
CATCGCACGC TGATTATCGC CACTCACCGC ACGGCGATGC TGCAACTGGT GGATAGGATA
ATCGTGATGG ATCAGGGGCG GATTGTGATG GACGGCGCGA AAGAGGCCAT TTTGCGTGAA
CAGGGCGAGC CCACAGCGCG ACGGGTGGTG TTGCAGGAGA AGAATAAGGG GTCTGCGGCA
TGA
 
Protein sequence
MKKNAQSVEA WLEAMIAVAR YYRLDFSQEN VRATVNWERD SKREELLTDM ARQLGMGLRL 
VEFSADSLNP WRLPLIAVFD NQQIGVITRR DNHDNISVQF SGDEGLETTL NVADIEDKIV
ELALLRPLSA IPDARVDDYI RPYQANWFWS LSLKDWRRYG DIMLASLVAN VLALAAMIFS
MQVYDRVVPA QSYPTLWVLF AGVMMAILFE FCMRMVRTHL SDVIGKRADL RISDRVFGHA
LRLKNNVRSK STGSFISQIR ELESVRELIT STTIGAVADL PFFLLFVFIL WMIGGWLVLV
VLLALLLLVI PGLLVQRPLA RLANEGMRES AVRNATLVEA VQSIEDIKLL RAEQRFQNQW
NHTNDVASSI SMKQRFLTGL LLTWTQEVQS IVYVVVLLVG CFMVMNGDMT TGALVGTTIL
ASRTIAPLSQ ISGVLSRWQQ AKVARNGLDE LMKRPVDQPE HGKLVHKAVL HGNYQFSNAV
FYYDEEEKIA DVAIGKLNIQ AGEKIAILGR NGAGKSTLLQ MLAGMRIAQQ GQVLLDNISI
GQLDPADLRR DMGLLSQTGR LFFGSLRENL TMGMPEASDE DIERALTLSG ALPFVQKQKN
GLNYMIQEGG FGLSGGQRQT LLLARLLISQ PNIVLLDEPS ASLDEMAEAY LIEQLKQWIG
HRTLIIATHR TAMLQLVDRI IVMDQGRIVM DGAKEAILRE QGEPTARRVV LQEKNKGSAA