Gene OSTLU_30781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30781 
Symbollhcp2.4 
ID5000601 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp737460 
End bp738789 
Gene Length1330 bp 
Protein Length233 aa 
Translation table 
GC content62% 
IMG OID640416022 
Productprasinophyte light harvesting complex, chlorophyll binding 
Protein accessionXP_001416747 
Protein GI145344454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.260173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGCATACCT GCACACGCAC ATTTCTACAA TGTCTGCCCT TCTCGCTTCC TCCTTCGTCG 
GCCGCGTCGC CGCCTTCAAG GCGACCAAGA TCCAAGTGCG TATTTTTTAG CCTTCGCGCG
ACGCGCGACG ACGCGCGGGA CGACGACGCG CGCGAGGGAC GCGATCGCAT CGCGCGCCAT
CGATCGACGG CGTGAGATAC GCGACATCGC GCGCGCGGCG CGCATCGCGT CGTCGCGACG
CGGGACGTGC GACGCCGCGA AGGGCGTCGC GGGAAGCGAT GTCGATGATT ATTTGTGATC
GCGACGCGAC GCGCGTCGGC GTCGCGACGC GTCGCGACGG GCGAGCGGGC GCTCGATCGC
CGCGGCGATA GGATAACCGC GATACGGATA TTTTCTATCT GTTTGTTTGA CTGGACGAAG
CGCGGGCGCG GGGCGGAGGA CGCGGGTGAT TGGATCGATC GTTCGAGCGA AGAGAGGAGA
CTGACGAAAG AAACGCGATG TGTTTTACGA AATGAACAGG CCAAGTCTGT CTCCACGACG
GTCAAGGCTG ACATCTACCC GGAATTCGGT ACCTACCCGG GCGGTGGCGA ATCCCCGATC
ATCCCGTTCG GCGACGAAAA GAACGCCGAG CGTGAAGTGA TCCACGGCCG CTGGGCGATG
CTTGGCGTCA CCGGTGCGTG GGCCGCCGAA AACGGCACCG GCATCCCGTG GTTCACCGCG
GGTACCTTGT GCACCCCGGA TGACTGCACC GCCGTCGCGG ACAAGTTCCC GGGCGCCGTC
GCCCCGCTCG CGCCGGAAGG CTCTGGCTAC CCGTCCTTCT GGAACGTTCT CATCATCGAG
ATCGTTCTCG TCGGCGCCGC GGAAGCGTAC CGTACCGGTA TCTCCGACTC TCCGTTCGAT
GATGGCCTCA CCGTCGGTGA CGTCAACCCG GGTGGACGCT TCGACCCGCT CGGCCTCGCC
GAGTCTGGCG ACCTTGAAGA ACTCAAGATC AAGGAGCTCA AGCACTGCCG CTTGTCCATG
TTCGCGTGGT TGGGCTGCAT CTTCCAAGCG CTCGCCACCC AAGAAGGCCC GATCGCCAAC
TGGCAATCCC ACGTTGCGGA CCCGGTTCAC TCCAACGTCC TCACCAACGC GGCCAAGGGC
TTCGGCTTCT ACTAAGCGGT TCACCGCCTT GGTAGCTTCG TCATAGGGTA GCTTGATCGG
CGGCCGTCGA CTTCGCGTCT ACGGTCACCT CCAAGATTTC TCTGACAGCG CTGGGAATTC
CCGACTGCTT TTGGGGCTTT GTCTCTTCAA TAACATTCGT TTTAATGATG CATCTCTCGA
TGTTTGATTA
 
Protein sequence
MSALLASSFV GRVAAFKATK IQAKSVSTTV KADIYPEFGT YPGGGESPII PFGDEKNAER 
EVIHGRWAML GVTGAWAAEN GTGIPWFTAG TLCTPDDCTA VADKFPGAVA PLAPEGSGYP
SFWNVLIIEI VLVGAAEAYR TGISDSPFDD GLTVGDVNPG GRFDPLGLAE SGDLEELKIK
ELKHCRLSMF AWLGCIFQAL ATQEGPIANW QSHVADPVHS NVLTNAAKGF GFY