Gene A9601_17431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_17431 
SymbolrpoA 
ID4718475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1485964 
End bp1486986 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content41% 
IMG OID640079471 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001010133 
Protein GI123969275 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTAG AAGACCTAAA CGCAGACGCG TTTAGGTTTG AACCATTCTC AACCCCAAAA 
AAACTCTTAA CCTCATTATT TTCCGTGTTG CAATACCAGA TTGACAGAAT CGACCATCAA
ATAGCAGATG ATCGCTCCCA AACAGGAACT TTTTTAATTG GTCCTCTTGA AAGGGGGCAA
GCTACAACTT TGGGTAATTC TCTTAGAAGA GTCCTTATGG GAGGACTTGA AGGGAGTGCA
GTTACAGCAG TAAGAATAGC AGGAATTAAT CATGAATATG CCACTATCCC TGGAGTTAGA
GAAGACGTTT TAGATATTCT TCTGAATTGC AAGCAACTAT CAATAAATAG TTCTAATCCA
GAGCTCGAAA TCGGCAGATT AGTGGCGAGC GGTCCAATGG AGGTGAAGGC GAATGATATT
CAATTCTCCT CTCAAGTTGA AATTGTTGAT GGCGAAAAAC CGATCGCAAC AATTCAGGAG
GGGCATAACT TAGAGTTGGA AATCCATGTT GAAAGGGGTG TTGGATATAG ACCAGTCGAC
CGTAAGAGTG AAGAGACAAC TGCTATTGAT TTACTTCAAA TAGATGCAGT ATTTATGCCA
GTGAAGAGGG TAAATTTTAC GATTGATGAA ACTGCTGTAG CAGAGGGCGC AACAGGAAGA
GAAAGATTAA AAATGGAAGT AGTTACAGAT GGCTCAACAA GTCCTGACGA TGCTATTGCT
GAAGCTGCAA ATCAGTTAAT AGAACTCTTT CAACCTCTTG CTACTGTCAC AATGGTTGAG
GAAATTCCTG AAGAACCCGA ACCATCTCCT GAAGCTCAAA TCCCCCTTGA GGAACTAAAC
TTGTCCGTTA GAGCATATAA TTGTTTGAAA AGGGCGCAAG TTAACTCAGT TTCTGATTTA
ATGGGCTTCA GCTATGAAGA TCTTCTAGAA ATTAAGAACT TTGGCTCTAA ATCTGCAGAT
GAGGTTATTG AGGCTCTTGA GCGCATCGGC ATTTCTATTC CACAAAGCAG AACATCTGTT
TAA
 
Protein sequence
MDVEDLNADA FRFEPFSTPK KLLTSLFSVL QYQIDRIDHQ IADDRSQTGT FLIGPLERGQ 
ATTLGNSLRR VLMGGLEGSA VTAVRIAGIN HEYATIPGVR EDVLDILLNC KQLSINSSNP
ELEIGRLVAS GPMEVKANDI QFSSQVEIVD GEKPIATIQE GHNLELEIHV ERGVGYRPVD
RKSEETTAID LLQIDAVFMP VKRVNFTIDE TAVAEGATGR ERLKMEVVTD GSTSPDDAIA
EAANQLIELF QPLATVTMVE EIPEEPEPSP EAQIPLEELN LSVRAYNCLK RAQVNSVSDL
MGFSYEDLLE IKNFGSKSAD EVIEALERIG ISIPQSRTSV