Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29951 |
Symbol | |
ID | 4776731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2646199 |
End bp | 2647233 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640088519 |
Product | Type II alternative RNA polymerase sigma factor, sigma-70 family protein |
Protein accession | YP_001018990 |
Protein GI | 124024683 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.982291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATCC CTCTGGAATC TGCGAAGGGT GCTTCACTCA CGCCTTCGTC AGAAGTTGTA TTACCGTCTA CATCTAAGCA ACCTTCAGAA AAAGCGAATC GAGCTGGCCG TAACGGGCAG ACATCCCGTA ACCAAAATCG CCAAGGTGGT CGTTTGGGTA CTGATGCGAT CGGGTTCTAC CTGAGCAGCA TCGGACGCGT CCCTTTGCTG ACCGCAGCTG AGGAAATTGA GCTGGCACAC CATGTGCAAG CGATGAAGGA ATTGCTGGAG TTACCAGAGC AAGACCACAC TCCACGACAA CGCCACAAAA TTCGCATGGG CAAACGCGCC CGTGACCGCA TGATGTCAGC CAACCTCCGG CTCGTGGTTA GCGTTGCCAA AAAATATCAG AATCAGGGCC TTGAACTCCT TGACCTAGTC CAAGAAGGAG CCATTGGTCT CGAACGTGCT GTCGACAAGT TCGATCCAGC CATGGGTTAT AAGTTCTCCA CCTACGCCTA TTGGTGGATT CGTCAAGGGA TGACCAGGGC CATCGACAAC AGCGCTCGCA CCATCCGTCT ACCCATTCAC ATCAGCGAAA AACTCTCCAA GATGCGACGC ATCTCAAGAG AGCTTTCCCA TCGTTTCGGC CGTCAACCAA ATCGATTGGA GTTAGCCCAT GCCATGGGCA TTCAACCCCA AGACCTTGAG GATCTCATCG CTCAAAGCGC TCCTTGCGCA TCTCTCGATG CCCATGCCCG CGGAGAAGAA GACCGCAGCA CCCTAGGTGA ACTGATACCC GACCCCAATG GGGCCGAACC AATGGAAGGC CTAGATCGCA GCATCCAAAA GGAACACCTA GGAGGTTGGC TATCTCAGCT CAATGAACGT GAACAGAAAA TCCTGCGCTT GCGCTTCGGT CTAGATGGTG AAGAACCACT GACCCTCGCT GAAATTGGTC GGCAAATCAG CGTCTCACGA GAACGCGTAC GACAGTTGGA GGCCAAAGCC ATTCTCAAGC TACGGATGAT GACCAACCAT CAACAAGCTG CATGA
|
Protein sequence | MGIPLESAKG ASLTPSSEVV LPSTSKQPSE KANRAGRNGQ TSRNQNRQGG RLGTDAIGFY LSSIGRVPLL TAAEEIELAH HVQAMKELLE LPEQDHTPRQ RHKIRMGKRA RDRMMSANLR LVVSVAKKYQ NQGLELLDLV QEGAIGLERA VDKFDPAMGY KFSTYAYWWI RQGMTRAIDN SARTIRLPIH ISEKLSKMRR ISRELSHRFG RQPNRLELAH AMGIQPQDLE DLIAQSAPCA SLDAHARGEE DRSTLGELIP DPNGAEPMEG LDRSIQKEHL GGWLSQLNER EQKILRLRFG LDGEEPLTLA EIGRQISVSR ERVRQLEAKA ILKLRMMTNH QQAA
|
| |