Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_07291 |
Symbol | |
ID | 4777149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 673670 |
End bp | 675004 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640086238 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_001016745 |
Protein GI | 124022438 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCCTG CAGCCACCCA AGCAGCAGCT GCAAAGGCCA AACCAAAAGC CTCTGCTAAG ACTGCCAAAC CCCAAGTGGC CAAAGGCACG GTTAAAGCCA AAGCCAAGAA AGTCAAGGCA ACAGCTCCTA AGGCCAAGAC TGCTGCCAAG CCGGTTAGTT CTGCTGCAAA AGCGTCCAGC CCCAAAGCAA AAAAAGCAAA AGCCACCCCT ACCCCTACTG TCAGCAAAAA CCTTGATTTA ACGGCTGACA AACTTCTTGC TACAGCCGCG ACAAGCGCCC CTAAGGCCAG CGCTGAAACT GAAAGCAGCC AAGCTGCTGC TAAAGCAAGC AGCGAAGCAG ACGCGAAGGC CAGAGCTCTA GCCAATATCA AGATCGGTCC TAAAGGCGTC TACACCGAAG ATTCCATCAG GGTTTATCTC CAGGAGATCG GCCGTATCCG CCTGCTCCGT CCAGACGAGG AGATTGAACT AGCCCGCAAA ATTGCTGACC TCCTGCATCT AGAGGAACTG GCAGAGCAGT TCAATAGTGA TCACGGTCAC TACCCCAACA ACAAGGAATG GGCTGCCCTT GTTGAAATGC CCAACATCAA ATTCCGCCGT CGCCTGATGC TGGGGCGCCG AGCCAAGGAA AAGATGGTGC AATCCAACCT GCGGCTGGTT GTTTCGATCG CCAAAAAATA CATGAATCGA GGCCTGTCCT TCCAGGATCT AATCCAAGAA GGCAGCCTGG GATTAATTCG TGCTGCCGAA AAATTCGACC ACGAGAAAGG GTACAAATTC TCCACCTATG CCACCTGGTG GATCCGTCAA GCCATCACCC GAGCAATTGC CGATCAGAGC CGCACAATCC GTCTTCCTGT TCACCTCTAC GAAACAATCT CACGCATCAA AAAAACAACC AAAGTGCTGA GCCAGGAGTT TGGCCGTAAA CCAACAGAAG AGGAAATTGC CGAAAGCATG GAGATGACAA TCGAAAAACT TCGATTTATC GCAAAAAGTG CTCAACTGCC CATCTCCCTT GAGACTCCGA TTGGCAAAGA GGAAGATTCC AGACTTGGTG ATTTCATCGA ATCTGATACT GAGAATCCAG AACAAGATGT CGCCAAAAAT CTCTTGCGAG AAGATCTAGA AGGTGTTCTC GCAACCCTCA GCCCCCGCGA ACGTGATGTT CTGCGTCTGC GCTATGGCCT TGACGATGGC CGCATGAAAA CCCTCGAGGA AATTGGACAG ATCTTTGAGG TCACACGAGA GCGCATCCGT CAGATTGAAG CCAAAGCCCT GCGCAAACTT CGCCACCCGA ATCGCAATGG TGTCTTAAAG GAATACATCA AGTAA
|
Protein sequence | MSPAATQAAA AKAKPKASAK TAKPQVAKGT VKAKAKKVKA TAPKAKTAAK PVSSAAKASS PKAKKAKATP TPTVSKNLDL TADKLLATAA TSAPKASAET ESSQAAAKAS SEADAKARAL ANIKIGPKGV YTEDSIRVYL QEIGRIRLLR PDEEIELARK IADLLHLEEL AEQFNSDHGH YPNNKEWAAL VEMPNIKFRR RLMLGRRAKE KMVQSNLRLV VSIAKKYMNR GLSFQDLIQE GSLGLIRAAE KFDHEKGYKF STYATWWIRQ AITRAIADQS RTIRLPVHLY ETISRIKKTT KVLSQEFGRK PTEEEIAESM EMTIEKLRFI AKSAQLPISL ETPIGKEEDS RLGDFIESDT ENPEQDVAKN LLREDLEGVL ATLSPRERDV LRLRYGLDDG RMKTLEEIGQ IFEVTRERIR QIEAKALRKL RHPNRNGVLK EYIK
|
| |