Gene ECH74115_B0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0018 
Symbol 
ID6966441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp713 
End bp3709 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content39% 
IMG OID643383925 
ProductRTX C- domain protein 
Protein accessionYP_002268404 
Protein GI209395598 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTAA ATAAAATAAA GAACATTTTC AATAATGCGA CATTGACTAC AAAATCAGCA 
TTTAATACAG CATCATCAAG CGTACGTTCC GCTGGAAAAA AACTCATATT ATTAATACCT
GATAATTATG AAGCTCAGGG CGTGGGTATT AATGAGTTGG TCAAAGCTGC TGATGAGCTT
GGAATAGAAA TACACCGTAC TGAACGAGAT GATACAGCGA TTGCAAACCA GTTTTTTGGT
GCAGCAGAAA AAGTTGTAGG ATTAACTGAA CGTGGTGTTG CAATATTCGC ACCACAACTT
GACAAACTTC TGCAGAAGTA TCAGAAAGTT GGGAGTAAAA TAGGAGGAAC CGCTGAAAAT
GTAGGTAATA ATCTGGGAAA AGCCGGAACA GTTCTCTCAG CACTACAGAA TTTTACGGGG
ATTGCTTTAT CAGGCATGGC TCTTGATGAA TTGCTGAGAA AACAACGGGC AGGAGAGGAT
ATAAGTCAGA ATGATATTGC CAAAAGTAGT ATTGAACTTA TTAATCAGCT TGTAGATACA
GTATCAAGTA TAAACAGTAC CGTTGATTCA TTTTCTGAGC AGCTTAACCA GCTTGGCTCA
TTTTTATCCA GTAAACCTCG ATTAAGTTCT GTTGGTGGGA AATTACAAAA TTTACCAGAC
CTGGGCCCCC TGGGGGATGG GCTGGATGTT GTCTCCGGAA TTCTTTCTGC TGTATCAGCA
AGCTTTATTC TGGGAAACAG TGACGCACAT ACAGGAACAA AAGCTGCAGC GGGTATCGAA
CTGACAACTC AGGTTCTTGG AAATGTTGGT AAAGCTGTTT CGCAATATAT TCTGGCTCAG
AGAATGGCAC AGGGGTTATC GACAACAGCT GCAAGTGCGG GTCTGATCAC ATCGGCTGTT
ATGCTGGCTA TCAGTCCTCT TTCTTTCCTG GCTGCTGCAG ATAAATTTGA GCGAGCTAAG
CAGCTTGAAT CATATTCTGA ACGATTTAAA AAATTGAATT ATGAAGGGGA TGCTTTACTC
GCAGCCTTTC ATAAAGAAAC CGGAGCTATA GATGCAGCCC TGACAACAAT AAATACTGTC
CTGAGTTCTG TATCTGCGGG AGTTAGTGCA GCCTCCAGTG CATCCCTCAT AGGGGCCCCG
ATAAGCATGC TGGTGAGTGC ATTAACCGGT ACGATATCTG GCATTCTGGA AGCATCAAAA
CAGGCTATGT TTGAGCACGT TGCAGAGAAA TTCGCTGCTC GGATCAATGA ATGGGAAAAG
GAGCATGGCA AAAATTATTT TGAGAATGGA TATGACGCAA GACATGCTGC GTTTTTAGAA
GACTCTCTGT CTTTGCTTGC TGATTTTTCT CGTCAGCATG CAGTAGAAAG AGCAGTCGCA
ATAACCCAGC AACATTGGGA TGAGAAGATC GGTGAACTTG CAGGCATAAC CCGTAATGCT
GATCGCAGTC AGAGTGGTAA GGCATATATT AATTATCTGG AAAATGGAGG GCTTTTAGAG
GCTCAACCGA AGGAGTTTAC ACAACAAGTG TTTGATCCTC AAAAAGGGAC CATAGACCTT
TCAACAGGTA ATGTATCAAG TGTTTTGACA TTTATAACAC CAACATTTAC CCCAGGAGAA
GAAGTTAGAG AAAGAAAACA GAGTGGTAAA TATGAATATA TGACATCTCT TATTGTAAAT
GGTAAGGATA CATGGTCTGT AAAAGGCATA AAAAATCATA AAGGTGTATA TGATTATTCA
AAATTGATTC AGTTTGTTGA AAAGAATAAC AAACACTATC AGGCGAGAAT AATTTCTGAG
CTCGGAGATA AAGACGATGT GGTTTATTCT GGAGCAGGCT CATCAGAAGT ATTTGCTGGT
GAAGGTTATG ATACCGTATC TTATAATAAG ACGGATGTTG GTAAACTAAC AATTGATGCA
ACAGGAGCAT CAAAACCTGG TGAGTATATA GTTTCAAAAA ATATGTATGG TGACGTGAAG
GTATTGCAGG AAGTCGTTAA GGAACAGGAG GTGTCAGTAG GGAAGCGAAC AGAGAAAATA
CAATATCGTG ATTTTGAATT CAGAACCGGT GGAATTCCTT ATGATGTAAT AGATAATCTT
CATTCTGTTG AAGAGCTCAT TGGCGGAAAA CATGATGATG AATTCAAAGG CGGTAAGTTT
AATGATATAT TCCATGGCGC AGATGGGAAC GATTATATCG AAGGTAATTA TGGTAATGAT
CGACTATACG GCGATGATGG GGATGATTAT ATATCCGGAG GACAGGGAGA CGACCAGTTA
TTTGGTGGTA GTGGAAACGA TAAATTGAGT GGAGGGGATG GTAATAATTA TCTGACAGGA
GGAAGCGGTA ATGATGAGCT TCAGGCACAC GGAGCTTATA ATATTCTGTC AGGTGGTACT
GGTGATGATA AACTTTATGG TGGTGGTGGT ATTGATCTTC TGGATGGAGG GGAAGGTAAT
GACTATCTGA ATGGTGGTTT TGGTAATGAT ATTTATGTTT ATGGGCAAAA CTATGGTCAT
CATACAATTG CAGATGAAGG AGGTAAAGGA GATCGTTTGC ACTTATCTGA TATTAGCTTT
GATGATATCG CATTTAAGAG AGTTGGAAAT GATCTTATCA TGAATAAAGC CATTAATGGT
GTACTTTCAT TTAATGAGTC AAATGATGTC AATGGGATAA CATTTAAAAA CTGGTTTGCG
AAAGATGCCT CAGGAGCAGA TAATCATCTT GTTGAGGTTA TAACAGATAA AGATGGTCGA
GAGATAAAAG TTGATAAGAT ACCTCATAAT AATAATGAAC GGTCAGGTTA TATAAAAGCC
AGTAATATAG CATCTGAAAA AAACATGGTT AATATCACCA GTGTTGCCAA TGATATTAAT
AAGATTATTT CTTCAGTTTC AGGGTTCGAT TCAGGTGATG AACGATTAGC ATCTTTATAT
AATTTATCCT TACATCAAAA CAACACACAC TCAACAACTT TAACGACAAC TGTCTGA
 
Protein sequence
MTVNKIKNIF NNATLTTKSA FNTASSSVRS AGKKLILLIP DNYEAQGVGI NELVKAADEL 
GIEIHRTERD DTAIANQFFG AAEKVVGLTE RGVAIFAPQL DKLLQKYQKV GSKIGGTAEN
VGNNLGKAGT VLSALQNFTG IALSGMALDE LLRKQRAGED ISQNDIAKSS IELINQLVDT
VSSINSTVDS FSEQLNQLGS FLSSKPRLSS VGGKLQNLPD LGPLGDGLDV VSGILSAVSA
SFILGNSDAH TGTKAAAGIE LTTQVLGNVG KAVSQYILAQ RMAQGLSTTA ASAGLITSAV
MLAISPLSFL AAADKFERAK QLESYSERFK KLNYEGDALL AAFHKETGAI DAALTTINTV
LSSVSAGVSA ASSASLIGAP ISMLVSALTG TISGILEASK QAMFEHVAEK FAARINEWEK
EHGKNYFENG YDARHAAFLE DSLSLLADFS RQHAVERAVA ITQQHWDEKI GELAGITRNA
DRSQSGKAYI NYLENGGLLE AQPKEFTQQV FDPQKGTIDL STGNVSSVLT FITPTFTPGE
EVRERKQSGK YEYMTSLIVN GKDTWSVKGI KNHKGVYDYS KLIQFVEKNN KHYQARIISE
LGDKDDVVYS GAGSSEVFAG EGYDTVSYNK TDVGKLTIDA TGASKPGEYI VSKNMYGDVK
VLQEVVKEQE VSVGKRTEKI QYRDFEFRTG GIPYDVIDNL HSVEELIGGK HDDEFKGGKF
NDIFHGADGN DYIEGNYGND RLYGDDGDDY ISGGQGDDQL FGGSGNDKLS GGDGNNYLTG
GSGNDELQAH GAYNILSGGT GDDKLYGGGG IDLLDGGEGN DYLNGGFGND IYVYGQNYGH
HTIADEGGKG DRLHLSDISF DDIAFKRVGN DLIMNKAING VLSFNESNDV NGITFKNWFA
KDASGADNHL VEVITDKDGR EIKVDKIPHN NNERSGYIKA SNIASEKNMV NITSVANDIN
KIISSVSGFD SGDERLASLY NLSLHQNNTH STTLTTTV