Gene OSTLU_119541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119541 
SymbolPrp45 
ID5000182 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp514099 
End bp515210 
Gene Length1112 bp 
Protein Length285 aa 
Translation table 
GC content47% 
IMG OID640415603 
ProductPre-mRNA splicing factor prp45 
Protein accessionXP_001416169 
Protein GI145342249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.126832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTTC GAATGTTGCA ATACGGCGAC GAGCGAGCCT CGCGTGTTGA CGACGTCGGC 
GGCGACGGTG GCGCTTTTCC TGAACGACGG CATATCGAAA CCGGGGGCTC CCCAGCGTCG
TTGGCTCAAG ATTCGACACG TATAAGCCCG CATGGTGCCG TGTCGACATC GATCATAGAT
ACAAAGGAAA TATCCGTCGT TCAAAGTGAT TTGCGGCTCG CTCGGTCTCA GGTAAGTTTT
TTGTTGCTGT GAGCTTGTGT CTTCGACTGA GTCTCGTCTC TATCTAGATA ATCACACCAG
AGACCAGGGC AAAACAACAA AAACAGCTCC AAGATGCGAT CGACGGCAAG CTATACCCTC
AGAAACATCT TACTGATCTG AAGGGAGGGA ACACCCACGC CCAATACATC AAATATGCGC
CGGTGGGTGC TCGAGAGCGA GTAATTAAAG TGCAGCAGCT ACAACAAGAC CCCTTGAGTC
CACCAGCATT CCGCCTAAAA AAGGTTTGAA GCTTGTGGAG TGTTTCTGTT TCACATCTGA
GATTTTCGAT AGGCTCCTAG TGAAGCAAAG CCCACTGCAG TCGCAGTCGT TCAGAGTCCT
CCACGCAAAG TAACCGAAGA AGATAAAGAG GCATGGCGGA TTCCTCCGTC AATATCGAAT
TGGAAAGTAC GTCGAACTTT GTGTGAAGTT TATACCTTCT CAATATTCTA CTTAGAATTC
GAAGGGCTAC ACTATACCCT TAGATAAGCG CCTTGCAGCA GATGGCCGAG GGTTAGTCGA
TGTGAAAATT AATGATAACT TTGCAAAATT TTCTGAGGTA TCTTTACTAT TGCATCGTCA
CTTCGGACGT GTTTGAATTG ATTCATAGGC ACTGTATGTT GCCGAACAAT CTGCACGCGT
TTCAGTTGAG ACACGTGCAA ATGTGCAAAA ACAAGTGGCA CTGCAGGAAC AGATTTCGAG
GGAACAAACT ATTCGGCAAG TAGCGGCTCA GGCCCTTCGT GATCAAAATG TAGAGAGCGA
AGGTCTAGAG GAAAAGGTGA GTGGCATGCG AACACTACAA CCTTTTATTT GACATCAAAC
TACAGTTAAG TCACAAGCGA ATGAAAAAAT GA
 
Protein sequence
MAVRMLQYGD ERASRVDDVG GDGGAFPERR HIETGGSPAS LAQDSTRISP HGAVSTSIID 
TKEISVVQSD LRLARSQIIT PETRAKQQKQ LQDAIDGKLY PQKHLTDLKG GNTHAQYIKY
APVGARERVI KVQQLQQDPL SPPAFRLKKA PSEAKPTAVA VVQSPPRKVT EEDKEAWRIP
PSISNWKNSK GYTIPLDKRL AADGRGLVDV KINDNFAKFS EALYVAEQSA RVSVETRANV
QKQVALQEQI SREQTIRQVA AQALRDQNVE SEGLEEKLSH KRMKK