Gene OSTLU_30338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30338 
Symbol 
ID5000553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp25966 
End bp27270 
Gene Length1305 bp 
Protein Length429 aa 
Translation table 
GC content57% 
IMG OID640415974 
Productpredicted protein 
Protein accessionXP_001416530 
Protein GI145344005 
COG category[A] RNA processing and modification 
COG ID[COG5623] Pre-mRNA cleavage and polyadenylation factor IA/II complex, subunit CLP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCTCGTCAT GGCGTCCGAT GGTGACGATG GCATCGCCGT GCAGACGTTC ACGCTGGAAC 
AAGAACAAGA ACTGCGCGTG GAGACGCCCG CGAGGGGGGA GATAAAGCTC AAGCTCGTCG
ATGGCACGGC GGAAGTCTTC GGGGCGGAGA TCGCCGTCGG GCAGAGCATT ACGTGTGTTT
CGGGACGTAA ACTCGCCGTT TTCACCTATC ACGGCGCGAC GATCGAAGTG AGAGGAGAGG
TAGAGATCGC GTACGTCGCC GGGGAGACGC CGATGGTGAG TTACGCGAAC ACGCACTCGG
TTTTGAATGC GAAACGCGTG GCCGCGGCGA GCGAGAATTC GAGCGAAGCC GAGGGACCGA
GGGTAATGTG TGTCGGACCG ACGGACGTGG GTAAGAGCAC GGTGTGTTCT ATATTATGTA
ACTACGCCAC GCGCGCCGGA CACGCTCCGC TGTACGTGGA TTTAGATTTA GGACAGGGCG
CGGTCACGGT GCCGGGAACG ATTTGCGCCG CGCCGATTGA CGCGCAGATA GACCTCGAAG
AGGGAATACC GCTGGAGATG CCTCTGGTGT ACTTTTACGG CGACTTGACT GTGAATAATC
CCGATTACTA TAAGCACATC GTCTCGAGGT TGGGCACTAT GCTAGACGAG CGAAGCAAGG
CAAACGAAGA GGCGCGCGCG GCTGGATGCG TGGTGAATAC GATGGGTTGG ATCGATGGCG
TCGGCCTGGA GCTCTTGCTT CACGCTCGAG AGGCGCTCAA GATTGATCAC GTCCTTGTCA
TTGGTCAGGA GCGTTTGTTC GGGCAACTGC AGCAAAAACT TAAGGGAACG GACTGCCAAG
TGTTTCGACT GCAAAAGTCT GGCGGCGTCG TTGAACGCAC GCCCGAGTAC CGCCGAGCAT
CTCGCGATCG CATGTTCAAG GAATACTTTT ACGGCGCTAC CGGCGAGCTC GCACCGGCGT
CGCAGACGGC TTATTTCTCG AAAATCAGCA TATATCGCAT CGGGGGTGGT CCGCGAGCGC
CGACGTCCGC GTTGCCAATC GGTCAAGCTC CGTCCACGGA TCCCATGCGA GTCACTCCCG
TTGTGCCTTC CACGTCGCTT TTGCACTCCG TCTTAGCGGT TAGTCACGGG AAAACACAGG
GGGACTTGCT CACTTCGAAC GTTGCTGGTT TCATTTATAT CACCGAGGTG AACATGATGC
AGAAATCGTT CACGTATCTG TCGCCGTGCC CGGGCGAATT GCCGTCAAAC GTCTTGCTCT
CTGGTAACTT GAAATGGTTA GGCGAAGATG TGAAGTAGCG AGTGA
 
Protein sequence
MASDGDDGIA VQTFTLEQEQ ELRVETPARG EIKLKLVDGT AEVFGAEIAV GQSITCVSGR 
KLAVFTYHGA TIEVRGEVEI AYVAGETPMV SYANTHSVLN AKRVAAASEN SSEAEGPRVM
CVGPTDVGKS TVCSILCNYA TRAGHAPLYV DLDLGQGAVT VPGTICAAPI DAQIDLEEGI
PLEMPLVYFY GDLTVNNPDY YKHIVSRLGT MLDERSKANE EARAAGCVVN TMGWIDGVGL
ELLLHAREAL KIDHVLVIGQ ERLFGQLQQK LKGTDCQVFR LQKSGGVVER TPEYRRASRD
RMFKEYFYGA TGELAPASQT AYFSKISIYR IGGGPRAPTS ALPIGQAPST DPMRVTPVVP
STSLLHSVLA VSHGKTQGDL LTSNVAGFIY ITEVNMMQKS FTYLSPCPGE LPSNVLLSGN
LKWLGEDVK