Gene OSTLU_119616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119616 
SymbolRpb3 
ID5000477 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp650476 
End bp651484 
Gene Length1009 bp 
Protein Length318 aa 
Translation table 
GC content46% 
IMG OID640415898 
ProductDNA-directed RNA polymerase II subunit 3 
Protein accessionXP_001416207 
Protein GI145342475 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.67705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCGG GGGAAGAAAC CGGTGCTTCA GTGACAGTGG CACGTTTAGA GGACGAAACT 
ATCATTTTCG ATTTGAAAGG AGTTGATGTC AGCCTAGCAA ACGCACTCAG GCGGCTAATG
ATAGCGGATG TCCCAACTGT TTCAATTGAC TTAGTCGAGG TGATAGAAAA CTCCTCTGTT
CTGTGTGATG AGTTCTTGGC ACACCGCTTG GGTCTGATCC CCCTTGACAG CACAAAAGCT
TCAGAACTCG TGAAGCCTTA CGAGTACACT GGAGACGATG ATACCGCAAC AGATGTGCAC
TTGGAACTCA ATGTGCGATG TCAGAGCGAC CAGACAAGGG ACGTCACGAG CGACGATCTG
ATCTCACACG ACGAAAGAGT AAAACCAGTG AGCTTTGGGG GGACAGGTGG TGGTTCTGCG
AAGTCAGGCG GGATTCTGAT AGCAAAACTA CGCAAAAACC AGCAGTTATC GTTGAAATGT
ATCGCAAGAA AAGGCACTGG TAAGGATCAT GCCAAGTGGT CCCCAGTCGC TACGGCCGTG
TTTAAGTACA CTCCCTTGAT TGACATCAAT CACGGCCTCC TGAACTCGCT AAATGGTAAG
AGCCAGACTA GAATTCTTTA CCACAACTTT AAACGTCACA ATGCAGGACC GGAAAAGGCA
GCGATCGTGG AGAGCGATCC ATCCAAAATG TTTAAATATG ACGCCGACAC GGATACTTTT
ACTCTCACCT CTCCAGAGTC ATGTACTTAC GATGGTGAAG TTATGAAGAA GGTAAGATTC
AACATTCGTT TCATCTATAT AAATGAACTC TTGACCTTGA CAGGTAAACG AGCTCGGAAA
GCCTGGATTG ATTGATGTGC GGCCCGGTCT GGACTGTTTT ACTTTCATCG TTGAGTCAAC
TGGAGTATTG AAAGTTGAGG AGGTTGTTCT ACAGGCAGTG CATATTTTAC AAAGTAAACT
GGATACTATA GGAGTAAGTT CGTGTTTTGA CAAATTAGAA ATTCAATAA
 
Protein sequence
MIPGEETGAS VTVARLEDET IIFDLKGVDV SLANALRRLM IADVPTVSID LVEVIENSSV 
LCDEFLAHRL GLIPLDSTKA SELVKPYEYT GDDDTATDVH LELNVRCQSD QTRDVTSDDL
ISHDERVKPV SFGGTGGGSA KSGGILIAKL RKNQQLSLKC IARKGTGKDH AKWSPVATAV
FKYTPLIDIN HGLLNSLNGK SQTRILYHNF KRHNAGPEKA AIVESDPSKM FKYDADTDTF
TLTSPESCTY DGEVMKKVNE LGKPGLIDVR PGLDCFTFIV ESTGVLKVEE VVLQAVHILQ
SKLDTIGVSS CFDKLEIQ