Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0683 |
Symbol | cstA |
ID | 6968750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 714639 |
End bp | 716744 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384719 |
Product | carbon starvation protein A |
Protein accession | YP_002269232 |
Protein GI | 209398506 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1966] Carbon starvation protein, predicted membrane protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.720109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAT CAGGGAAATA CCTCGTCTGG ACAGTGCTCT CTGTAATGGG AGCATTTGCT CTGGGATATA TTGCTTTAAA TCGTGGGGAA CAGATCAACG CGCTGTGGAT TGTGGTGGCG TCGGTCTGTA TCTATCTGAT CGCTTACCGT TTTTATGGTC TGTATATCGC CAAAAATGTG CTGGCGGTTG ACCCGACGCG TATGACGCCA GCGGTGCGCC ATAACGACGG GCTGGACTAT GTGCCGACGG ACAAGAAAGT GCTGTTCGGT CACCATTTTG CGGCCATTGC CGGAGCAGGT CCGTTGGTTG GGCCGGTACT GGCGGCGCAA ATGGGTTACC TGCCAGGGAT GATCTGGCTA CTCGCCGGGG TGGTTCTCGC CGGTGCGGTG CAGGATTTCA TGGTGCTGTT TGTTTCTACG CGCCGTGACG GTCGTTCGCT GGGTGAGCTG GTCAAAGAAG AGATGGGGCC AACCGCCGGG GTGATCGCGC TGGTGGCCTG CTTTATGATC ATGGTCATTA TCCTTGCAGT ACTGGCGATG ATCGTGGTGA AAGCCCTGAC TCATAGCCCG TGGGGAACGT ATACCGTTGC GTTCACCATT CCGCTGGCGC TGTTCATGGG GATCTACCTG CGCTATCTGC GTCCGGGGCG TATTGGTGAA GTGTCGGTCA TCGGTCTGGT ATTCCTGATT TTCGCCATTA TTTCTGGCGG CTGGGTGGCA GAAAGTCCGA CCTGGGCACC GTACTTTGAC TTTACCGGCG TGCAGCTGAC CTGGATGCTG GTGGGTTACG GTTTTGTGGC GGCGGTGCTG CCGGTATGGT TGCTGCTGGC TCCGCGTGAC TACCTCTCTA CCTTCCTGAA AATCGGGACG ATCGTTGGTC TGGCGGTAGG CATTTTGATT ATGCGCCCGA CGCTGACCAT GCCTGCGTTG ACCAAATTTG TTGACGGCAC TGGCCCGGTA TGGACCGGTA ACCTGTTCCC GTTCCTGTTT ATCACCATCG CCTGTGGCGC GGTGTCTGGC TTCCATGCGC TGATCTCTTC CGGCACCACG CCGAAGATGC TGGCGAACGA AGGGCAGGCT TGTTTTATCG GCTACGGTGG AATGTTAATG GAATCCTTCG TGGCGATTAT GGCACTGGTT TCCGCCTGTA TCATCGATCC GGGCGTGTAC TTCGCCATGA ACAGCCCGAT GGCGGTGCTG GCTCCGGCAG GGACGGCGGA TGTGGTCGCT TCTGCCGCGC AAGTGGTGAG TAGCTGGGGC TTTGCGATTA CTCCGGATAC GCTCAACCAG ATTGCCAGCG AAGTGGGAGA ACAGTCGATC ATTTCCCGTG CGGGCGGTGC GCCGACGTTG GCGGTGGGGA TGGCCTACAT TCTGCATGGC GCGCTGGGCG GCATGATGGA TGTGGCGTTC TGGTATCACT TCGCCATTCT GTTTGAAGCT CTGTTTATTC TGACGGCGGT GGATGCAGGT ACGCGTGCTG CGCGCTTTAT GTTGCAGGAT CTGCTGGGCG TGGTTTCTCC GGGCCTGAAA CGGACCGATT CACTGCCTGC TAACCTGCTG GCAACGGCGC TGTGCGTGCT GGCGTGGGGC TACTTCCTGC ATCAGGGTGT GGTCGATCCG CTGGGCGGTA TTAACACTCT GTGGCCGCTG TTTGGTATTG CCAACCAGAT GCTGGCAGGG ATGGCGCTGA TGCTCTGTGC CGTGGTGTTG TTCAAGATGA AACGTCAACG TTACGCCTGG GTGGCGCTGG TACCAACGGC CTGGCTGCTG ATTTGTACCC TGACCGCAGG CTGGCAGAAA GCGTTTAGCC CGGATGCGAA AGTCGGCTTC CTGGCCATTG CTAATAAGTT CCAGGCAATG ATCGACAGCG GTAATATTCC ATCGCAGTAT ACTGAGTCAC AGCTGGCGCA ACTGGTGTTC AACAACCGTC TGGATGCCGG GTTAACCATC TTCTTTATGG TGGTCGTGGT GGTTCTGGCA CTGTTCTCGA TTAAGACGGC ACTTGCGGCA TTGAAAGAGC CGAAGCCAAC GGCGAAAGAA ACGCCGTATG AGCCAATGCC GGAAAATGTC GAGGAGATCG TGGTGCAGGC AAAAGGCGCG CACTAA
|
Protein sequence | MNKSGKYLVW TVLSVMGAFA LGYIALNRGE QINALWIVVA SVCIYLIAYR FYGLYIAKNV LAVDPTRMTP AVRHNDGLDY VPTDKKVLFG HHFAAIAGAG PLVGPVLAAQ MGYLPGMIWL LAGVVLAGAV QDFMVLFVST RRDGRSLGEL VKEEMGPTAG VIALVACFMI MVIILAVLAM IVVKALTHSP WGTYTVAFTI PLALFMGIYL RYLRPGRIGE VSVIGLVFLI FAIISGGWVA ESPTWAPYFD FTGVQLTWML VGYGFVAAVL PVWLLLAPRD YLSTFLKIGT IVGLAVGILI MRPTLTMPAL TKFVDGTGPV WTGNLFPFLF ITIACGAVSG FHALISSGTT PKMLANEGQA CFIGYGGMLM ESFVAIMALV SACIIDPGVY FAMNSPMAVL APAGTADVVA SAAQVVSSWG FAITPDTLNQ IASEVGEQSI ISRAGGAPTL AVGMAYILHG ALGGMMDVAF WYHFAILFEA LFILTAVDAG TRAARFMLQD LLGVVSPGLK RTDSLPANLL ATALCVLAWG YFLHQGVVDP LGGINTLWPL FGIANQMLAG MALMLCAVVL FKMKRQRYAW VALVPTAWLL ICTLTAGWQK AFSPDAKVGF LAIANKFQAM IDSGNIPSQY TESQLAQLVF NNRLDAGLTI FFMVVVVVLA LFSIKTALAA LKEPKPTAKE TPYEPMPENV EEIVVQAKGA H
|
| |