Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0090 |
Symbol | smoC |
ID | 3719921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 1800089 |
End bp | 1801042 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640071293 |
Product | operon regulator SmoC |
Protein accession | YP_353166 |
Protein GI | 77463662 |
COG category | [K] Transcription |
COG ID | [COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0568695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACC GGCCCGAGAG CGAGCCGACG CCCCTCGACG ACGCCGCACG CGCGGGCTGG CTCTATTACG TCGCAGGCCT GACGCAAGAC CAGATCGCGC GGGAACTCGG CACCTCGCGC CAGAGGGCGC AGCGACTGGT GAGCCGGGCC ATCTCCGAAC GGCTGATCCA TGTCCGGCTC GAGCATCGGG TCTCGGGCTG CCTGCATCTG GAAGCCGCTC TGATCCGACG CTTCGGGCTG AAGCTGGCCC GCGTGGCGCC GAGCCTCGGG TCCGAGGTGG ATCCCCTGCC CTCCATCGCC CCCACCGCCG CCGCCGAGGT GGAGCGGGTG CTGCGCTCGG AGCGGCCGAT GGTGGTGGCC TTCGGCACCG GCCGGTCGCT GCGCGCCACC GTCGAGGAGA TGACCTCGAT GGTCTGCGAG CAGCACAAGA TCGTGTCGCT CAACGGGAAT ATCTCGGCGG ATGGCTCGGC CTCCTACTAC GATGTGATCT TCCGCATCGC CGACCGTGTG CGCGCGCCGC ACTATCCGAT GCCGATGCCC GTCATCGCGC AGGATGCGGC GGAGCGGGAG CTGTTTCATG CGCTGAAGCC CGTGCAGTCG GTGCTGCGGC TTGCCCGCAA TGCCGATGTG ACCTTCGTCG GGCTGGGACA GATGGGCGAG GACGCGCCGC TCCTGAAGGA CGGGTTCATC ACGCCCGACG AACTGGCCGA GATGCAGGAG CTGGGCGCGG TCGGAGAGGT GGCGGGATGG GTCTTCGACT CGGAGGGCCG CTACCTCGAA ACCAGCATCA ATCAGCGAGT TGCGGGCGTC CGTGTCGAAC TTTCCGAGGA TCGGACAGTG GTCGCCATCG CAGGCGGCAG ACGCAAGCTC GCGGCGCTGC ACGCAGGCCT AAGGGGCCGT CTTTTCAACG GCCTGATCAC CGACGAGCTC ACGGCGCAGG CACTTCTGTC CTGA
|
Protein sequence | MIDRPESEPT PLDDAARAGW LYYVAGLTQD QIARELGTSR QRAQRLVSRA ISERLIHVRL EHRVSGCLHL EAALIRRFGL KLARVAPSLG SEVDPLPSIA PTAAAEVERV LRSERPMVVA FGTGRSLRAT VEEMTSMVCE QHKIVSLNGN ISADGSASYY DVIFRIADRV RAPHYPMPMP VIAQDAAERE LFHALKPVQS VLRLARNADV TFVGLGQMGE DAPLLKDGFI TPDELAEMQE LGAVGEVAGW VFDSEGRYLE TSINQRVAGV RVELSEDRTV VAIAGGRRKL AALHAGLRGR LFNGLITDEL TAQALLS
|
| |