Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0018 |
Symbol | |
ID | 6966441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 713 |
End bp | 3709 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643383925 |
Product | RTX C- domain protein |
Protein accession | YP_002268404 |
Protein GI | 209395598 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGTAA ATAAAATAAA GAACATTTTC AATAATGCGA CATTGACTAC AAAATCAGCA TTTAATACAG CATCATCAAG CGTACGTTCC GCTGGAAAAA AACTCATATT ATTAATACCT GATAATTATG AAGCTCAGGG CGTGGGTATT AATGAGTTGG TCAAAGCTGC TGATGAGCTT GGAATAGAAA TACACCGTAC TGAACGAGAT GATACAGCGA TTGCAAACCA GTTTTTTGGT GCAGCAGAAA AAGTTGTAGG ATTAACTGAA CGTGGTGTTG CAATATTCGC ACCACAACTT GACAAACTTC TGCAGAAGTA TCAGAAAGTT GGGAGTAAAA TAGGAGGAAC CGCTGAAAAT GTAGGTAATA ATCTGGGAAA AGCCGGAACA GTTCTCTCAG CACTACAGAA TTTTACGGGG ATTGCTTTAT CAGGCATGGC TCTTGATGAA TTGCTGAGAA AACAACGGGC AGGAGAGGAT ATAAGTCAGA ATGATATTGC CAAAAGTAGT ATTGAACTTA TTAATCAGCT TGTAGATACA GTATCAAGTA TAAACAGTAC CGTTGATTCA TTTTCTGAGC AGCTTAACCA GCTTGGCTCA TTTTTATCCA GTAAACCTCG ATTAAGTTCT GTTGGTGGGA AATTACAAAA TTTACCAGAC CTGGGCCCCC TGGGGGATGG GCTGGATGTT GTCTCCGGAA TTCTTTCTGC TGTATCAGCA AGCTTTATTC TGGGAAACAG TGACGCACAT ACAGGAACAA AAGCTGCAGC GGGTATCGAA CTGACAACTC AGGTTCTTGG AAATGTTGGT AAAGCTGTTT CGCAATATAT TCTGGCTCAG AGAATGGCAC AGGGGTTATC GACAACAGCT GCAAGTGCGG GTCTGATCAC ATCGGCTGTT ATGCTGGCTA TCAGTCCTCT TTCTTTCCTG GCTGCTGCAG ATAAATTTGA GCGAGCTAAG CAGCTTGAAT CATATTCTGA ACGATTTAAA AAATTGAATT ATGAAGGGGA TGCTTTACTC GCAGCCTTTC ATAAAGAAAC CGGAGCTATA GATGCAGCCC TGACAACAAT AAATACTGTC CTGAGTTCTG TATCTGCGGG AGTTAGTGCA GCCTCCAGTG CATCCCTCAT AGGGGCCCCG ATAAGCATGC TGGTGAGTGC ATTAACCGGT ACGATATCTG GCATTCTGGA AGCATCAAAA CAGGCTATGT TTGAGCACGT TGCAGAGAAA TTCGCTGCTC GGATCAATGA ATGGGAAAAG GAGCATGGCA AAAATTATTT TGAGAATGGA TATGACGCAA GACATGCTGC GTTTTTAGAA GACTCTCTGT CTTTGCTTGC TGATTTTTCT CGTCAGCATG CAGTAGAAAG AGCAGTCGCA ATAACCCAGC AACATTGGGA TGAGAAGATC GGTGAACTTG CAGGCATAAC CCGTAATGCT GATCGCAGTC AGAGTGGTAA GGCATATATT AATTATCTGG AAAATGGAGG GCTTTTAGAG GCTCAACCGA AGGAGTTTAC ACAACAAGTG TTTGATCCTC AAAAAGGGAC CATAGACCTT TCAACAGGTA ATGTATCAAG TGTTTTGACA TTTATAACAC CAACATTTAC CCCAGGAGAA GAAGTTAGAG AAAGAAAACA GAGTGGTAAA TATGAATATA TGACATCTCT TATTGTAAAT GGTAAGGATA CATGGTCTGT AAAAGGCATA AAAAATCATA AAGGTGTATA TGATTATTCA AAATTGATTC AGTTTGTTGA AAAGAATAAC AAACACTATC AGGCGAGAAT AATTTCTGAG CTCGGAGATA AAGACGATGT GGTTTATTCT GGAGCAGGCT CATCAGAAGT ATTTGCTGGT GAAGGTTATG ATACCGTATC TTATAATAAG ACGGATGTTG GTAAACTAAC AATTGATGCA ACAGGAGCAT CAAAACCTGG TGAGTATATA GTTTCAAAAA ATATGTATGG TGACGTGAAG GTATTGCAGG AAGTCGTTAA GGAACAGGAG GTGTCAGTAG GGAAGCGAAC AGAGAAAATA CAATATCGTG ATTTTGAATT CAGAACCGGT GGAATTCCTT ATGATGTAAT AGATAATCTT CATTCTGTTG AAGAGCTCAT TGGCGGAAAA CATGATGATG AATTCAAAGG CGGTAAGTTT AATGATATAT TCCATGGCGC AGATGGGAAC GATTATATCG AAGGTAATTA TGGTAATGAT CGACTATACG GCGATGATGG GGATGATTAT ATATCCGGAG GACAGGGAGA CGACCAGTTA TTTGGTGGTA GTGGAAACGA TAAATTGAGT GGAGGGGATG GTAATAATTA TCTGACAGGA GGAAGCGGTA ATGATGAGCT TCAGGCACAC GGAGCTTATA ATATTCTGTC AGGTGGTACT GGTGATGATA AACTTTATGG TGGTGGTGGT ATTGATCTTC TGGATGGAGG GGAAGGTAAT GACTATCTGA ATGGTGGTTT TGGTAATGAT ATTTATGTTT ATGGGCAAAA CTATGGTCAT CATACAATTG CAGATGAAGG AGGTAAAGGA GATCGTTTGC ACTTATCTGA TATTAGCTTT GATGATATCG CATTTAAGAG AGTTGGAAAT GATCTTATCA TGAATAAAGC CATTAATGGT GTACTTTCAT TTAATGAGTC AAATGATGTC AATGGGATAA CATTTAAAAA CTGGTTTGCG AAAGATGCCT CAGGAGCAGA TAATCATCTT GTTGAGGTTA TAACAGATAA AGATGGTCGA GAGATAAAAG TTGATAAGAT ACCTCATAAT AATAATGAAC GGTCAGGTTA TATAAAAGCC AGTAATATAG CATCTGAAAA AAACATGGTT AATATCACCA GTGTTGCCAA TGATATTAAT AAGATTATTT CTTCAGTTTC AGGGTTCGAT TCAGGTGATG AACGATTAGC ATCTTTATAT AATTTATCCT TACATCAAAA CAACACACAC TCAACAACTT TAACGACAAC TGTCTGA
|
Protein sequence | MTVNKIKNIF NNATLTTKSA FNTASSSVRS AGKKLILLIP DNYEAQGVGI NELVKAADEL GIEIHRTERD DTAIANQFFG AAEKVVGLTE RGVAIFAPQL DKLLQKYQKV GSKIGGTAEN VGNNLGKAGT VLSALQNFTG IALSGMALDE LLRKQRAGED ISQNDIAKSS IELINQLVDT VSSINSTVDS FSEQLNQLGS FLSSKPRLSS VGGKLQNLPD LGPLGDGLDV VSGILSAVSA SFILGNSDAH TGTKAAAGIE LTTQVLGNVG KAVSQYILAQ RMAQGLSTTA ASAGLITSAV MLAISPLSFL AAADKFERAK QLESYSERFK KLNYEGDALL AAFHKETGAI DAALTTINTV LSSVSAGVSA ASSASLIGAP ISMLVSALTG TISGILEASK QAMFEHVAEK FAARINEWEK EHGKNYFENG YDARHAAFLE DSLSLLADFS RQHAVERAVA ITQQHWDEKI GELAGITRNA DRSQSGKAYI NYLENGGLLE AQPKEFTQQV FDPQKGTIDL STGNVSSVLT FITPTFTPGE EVRERKQSGK YEYMTSLIVN GKDTWSVKGI KNHKGVYDYS KLIQFVEKNN KHYQARIISE LGDKDDVVYS GAGSSEVFAG EGYDTVSYNK TDVGKLTIDA TGASKPGEYI VSKNMYGDVK VLQEVVKEQE VSVGKRTEKI QYRDFEFRTG GIPYDVIDNL HSVEELIGGK HDDEFKGGKF NDIFHGADGN DYIEGNYGND RLYGDDGDDY ISGGQGDDQL FGGSGNDKLS GGDGNNYLTG GSGNDELQAH GAYNILSGGT GDDKLYGGGG IDLLDGGEGN DYLNGGFGND IYVYGQNYGH HTIADEGGKG DRLHLSDISF DDIAFKRVGN DLIMNKAING VLSFNESNDV NGITFKNWFA KDASGADNHL VEVITDKDGR EIKVDKIPHN NNERSGYIKA SNIASEKNMV NITSVANDIN KIISSVSGFD SGDERLASLY NLSLHQNNTH STTLTTTV
|
| |