Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_A0072 |
Symbol | sopB |
ID | 6106535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010488 |
Strand | - |
Start bp | 53664 |
End bp | 54635 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641614819 |
Product | plasmid-partitioning protein |
Protein accession | YP_001739960 |
Protein GI | 170650790 |
COG category | [K] Transcription |
COG ID | [COG1475] Predicted transcriptional regulators |
TIGRFAM ID | [TIGR00180] ParB-like partition proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0000633113 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCGTG CGCCTGTCAT TCCAAAACAT ACGATTAAGA CTCAACCGCT TGAAGATACT CCGTCATCGG CACCAGCTGC ACCGATGGTG GATTCGTTAA TTGCGCGCGT AGGAGCTATG GCTCGTGGCA ATGCTATTAC TTTGCCTGTA TGTGGTAGGG ACGTGAAATT TACCCTTGAG GTGCTCCGGG GAGATAGTGT TGAGAAGACC TCTCGGGTAT GGTCAGGTAA TGAACGTGAC CAGGAACTGC TTACCGAAGA CGCCCTGGAT GATCTTATCC CTTCTTTTCT ACTGACAGGT CAGCAGACAC CAGCATTCGG TCGAAGAGTG TCTGGGGTCA TAGAAATTGC CGATGGGAGC CGCCGTCGTA AAGCTGCTGC ACTTACAGAA AGTGATTATC GAGTTCTGGT TGGTGAGCTG GATGATGAGC AGATGGCTGC ATTGTCCAGA TTGGGTAACG ATTATAGGCC AACAAGTGCT TATGAACGTG GTCAGCGTTA TGCAAGCCGA TTGCAGAATG AATTTGCTGG AAATATTTCT GCGCTGGCTG ATGCAGAAAA TATTTCACGT AAGATTATTA CCCGCTGTAT CAACACAGCT AAACTGCCTA AATCAGTTGT TGCTCTTTTT TCTCATCCCG GTGAACTATC TGCCCGGTCA GGTGATGCAC TTCAAAGAGC TTTTACAGAT AAAGAAGAAT TACTTAAGCA GCAGGCATCT AACCTTCACG AGCAGAAAAA AGCAGGAGTG ATATTTGAAG CTGAAGAAGT CATCACTCTT TTAACGTCTG TGCTTAAAAC GTCATCTGCC TCAAGAACTA GCTTAAGCTC ACGACATCAG TTTGCTCCTG GAGCGACAGT GTTGTACAAG GGCGATAAAA TGGTGTTAAA CCTAGATAGA TCTCGTATTC CAACTGAGTG TATAGAGAGA ATTGAAGCCA TTCTTAAGGA ACTTGAAAAA GCGGCACTTT GA
|
Protein sequence | MKRAPVIPKH TIKTQPLEDT PSSAPAAPMV DSLIARVGAM ARGNAITLPV CGRDVKFTLE VLRGDSVEKT SRVWSGNERD QELLTEDALD DLIPSFLLTG QQTPAFGRRV SGVIEIADGS RRRKAAALTE SDYRVLVGEL DDEQMAALSR LGNDYRPTSA YERGQRYASR LQNEFAGNIS ALADAENISR KIITRCINTA KLPKSVVALF SHPGELSARS GDALQRAFTD KEELLKQQAS NLHEQKKAGV IFEAEEVITL LTSVLKTSSA SRTSLSSRHQ FAPGATVLYK GDKMVLNLDR SRIPTECIER IEAILKELEK AAL
|
| |