Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0045 |
Symbol | sopA |
ID | 6966381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 18792 |
End bp | 19949 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643383948 |
Product | plasmid-partitioning protein SopA |
Protein accession | YP_002268427 |
Protein GI | 209395633 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1192] ATPases involved in chromosome partitioning |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.257222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACAC TTAACCAGTG CATAAACGCT GGTCATGAAA TGACGAAGGC TATCGCCATT GCACAGTTTA ATGATGACAG CCCGGAAGCG AGGAAAATAA CCCGGCGCTG GAGAATAGGT GAAGCAGCGG ATTTAGTTGG GGTTTCTTCT CAGGCTATCA GAGATGCCGA GAAAGCAGGG CGACTACCGC ACCCGGATAT GGAAATTCGA GGACGGGTTG AGCAACGTGT TGGTTATACA ATTGAACAAA TTAATCATAT GCGTGATGTG TTTGGTACGC GATTGCGACG TGCTGAAGAC GTATTTCCAC CGGTGATCGG GGTTGCTGCC CATAAAGGTG GCGTTTACAA AACCTCAGTT TCTGTTCATC TTGCTCAGGA TCTGGCTCTG AAGGGGCTAC GTGTTTTGCT CGTGGAAGGT AACGACCCCC AGGGAACAGC CTCAATGTAT CACGGATGGG TACCAGATCT TCATATTCAT GCAGAAGACA CTCTCCTGCC TTTCTATCTT GGGGAAAAGG ACGATGTCAC TTATGCAATA AAGCCCACTT GCTGGCCGGG GCTTGACATT ATTCCTTCCT GTCTGGCTCT GCACCGTATT GAAACTGAGT TAATGGGCAA ATTTGATGAA GGTAAACTGC CCACCGATCC ACACCTGATG CTCCGACTGG CCATTGAAAC TGTTGCTCAT GACTATGATG TCATAGTTAT TGACAGCGCG CCTAACCTGG GTATCGGCAC GATTAATGTC GTATGTGCTG CTGATGTGCT GATTGTTCCC ACTCCTGCTG AGTTGTTTGA CTACACCTCC GCACTGCAGT TTTTCGATAT GCTTCGTGAT CTGCTCAAGA ACGTTGATCT TAAAGGGTTC GAGCCTGATG TACGTATTTT GCTTACCAAA TACAGCAATA GTAATGGCTC TCAGTCCCCG TGGATGGAGG AGCAAATTCG GGATGCCTGG GGAAGCATGG TTCTAAAAAA TGTTGTACGT GAAACGGATG AAGTTGGTAA AGGTCAGATC CGGATGAGAA CTGTTTTTGA ACAGGCCATT GATCAACGCT CTTCAACTGG TGCCTGGAGA AATGCTCTTT CTATTTGGGA ACCTGTCTGC AATGAAATTT TCGATCGTCT GATTAAACCA CGCTGGGAGA TTAGATAA
|
Protein sequence | METLNQCINA GHEMTKAIAI AQFNDDSPEA RKITRRWRIG EAADLVGVSS QAIRDAEKAG RLPHPDMEIR GRVEQRVGYT IEQINHMRDV FGTRLRRAED VFPPVIGVAA HKGGVYKTSV SVHLAQDLAL KGLRVLLVEG NDPQGTASMY HGWVPDLHIH AEDTLLPFYL GEKDDVTYAI KPTCWPGLDI IPSCLALHRI ETELMGKFDE GKLPTDPHLM LRLAIETVAH DYDVIVIDSA PNLGIGTINV VCAADVLIVP TPAELFDYTS ALQFFDMLRD LLKNVDLKGF EPDVRILLTK YSNSNGSQSP WMEEQIRDAW GSMVLKNVVR ETDEVGKGQI RMRTVFEQAI DQRSSTGAWR NALSIWEPVC NEIFDRLIKP RWEIR
|
| |