Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_17431 |
Symbol | rpoA |
ID | 4718475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1485964 |
End bp | 1486986 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640079471 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001010133 |
Protein GI | 123969275 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTAG AAGACCTAAA CGCAGACGCG TTTAGGTTTG AACCATTCTC AACCCCAAAA AAACTCTTAA CCTCATTATT TTCCGTGTTG CAATACCAGA TTGACAGAAT CGACCATCAA ATAGCAGATG ATCGCTCCCA AACAGGAACT TTTTTAATTG GTCCTCTTGA AAGGGGGCAA GCTACAACTT TGGGTAATTC TCTTAGAAGA GTCCTTATGG GAGGACTTGA AGGGAGTGCA GTTACAGCAG TAAGAATAGC AGGAATTAAT CATGAATATG CCACTATCCC TGGAGTTAGA GAAGACGTTT TAGATATTCT TCTGAATTGC AAGCAACTAT CAATAAATAG TTCTAATCCA GAGCTCGAAA TCGGCAGATT AGTGGCGAGC GGTCCAATGG AGGTGAAGGC GAATGATATT CAATTCTCCT CTCAAGTTGA AATTGTTGAT GGCGAAAAAC CGATCGCAAC AATTCAGGAG GGGCATAACT TAGAGTTGGA AATCCATGTT GAAAGGGGTG TTGGATATAG ACCAGTCGAC CGTAAGAGTG AAGAGACAAC TGCTATTGAT TTACTTCAAA TAGATGCAGT ATTTATGCCA GTGAAGAGGG TAAATTTTAC GATTGATGAA ACTGCTGTAG CAGAGGGCGC AACAGGAAGA GAAAGATTAA AAATGGAAGT AGTTACAGAT GGCTCAACAA GTCCTGACGA TGCTATTGCT GAAGCTGCAA ATCAGTTAAT AGAACTCTTT CAACCTCTTG CTACTGTCAC AATGGTTGAG GAAATTCCTG AAGAACCCGA ACCATCTCCT GAAGCTCAAA TCCCCCTTGA GGAACTAAAC TTGTCCGTTA GAGCATATAA TTGTTTGAAA AGGGCGCAAG TTAACTCAGT TTCTGATTTA ATGGGCTTCA GCTATGAAGA TCTTCTAGAA ATTAAGAACT TTGGCTCTAA ATCTGCAGAT GAGGTTATTG AGGCTCTTGA GCGCATCGGC ATTTCTATTC CACAAAGCAG AACATCTGTT TAA
|
Protein sequence | MDVEDLNADA FRFEPFSTPK KLLTSLFSVL QYQIDRIDHQ IADDRSQTGT FLIGPLERGQ ATTLGNSLRR VLMGGLEGSA VTAVRIAGIN HEYATIPGVR EDVLDILLNC KQLSINSSNP ELEIGRLVAS GPMEVKANDI QFSSQVEIVD GEKPIATIQE GHNLELEIHV ERGVGYRPVD RKSEETTAID LLQIDAVFMP VKRVNFTIDE TAVAEGATGR ERLKMEVVTD GSTSPDDAIA EAANQLIELF QPLATVTMVE EIPEEPEPSP EAQIPLEELN LSVRAYNCLK RAQVNSVSDL MGFSYEDLLE IKNFGSKSAD EVIEALERIG ISIPQSRTSV
|
| |