Gene OSTLU_30780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30780 
Symbollhcp2.2 
ID5000804 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp735869 
End bp737158 
Gene Length1290 bp 
Protein Length233 aa 
Translation table 
GC content65% 
IMG OID640416225 
Productprasinophyte light harvesting complex, chlorophyll binding 
Protein accessionXP_001417039 
Protein GI145345055 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.71449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.246837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTCGTAATGT CTGCCCTTCT CGCTTCCTCC TTCGTCGGCC GCGTCGCCGC CTTCAAGGCG 
ACCAAGATCC AAGTGCGTAT TTTTTAGCCT TCGCGCGACG CGCGACGACG CGCGGGACGA
CGACGCGCGC GAGGGACGCG ACGCGCGCGC GTCGAACGAA CGAACGAACG ATCGGGCGAT
CGATCGCGGT GGGCGATCGC GCGCGGCGCC CTCGCGCGGG CGGTGTCGCG CGGACGGGCG
CGGGACGGCG CGCGGTGGAT GCTGGAAATG TGTCGCGCGC GTCGCGCGAC GTCGGGGGTC
GAGGCGTCGG TCGCGGGCGA CGCGCGCGCG ACGCGGGCGA TCGGGCGGGA CGCCGCGCGC
GAGATACGGT TTCGCTCTTA TCGCCGCGGA TGGGGGTCGG CACGAGTCGT GAACCGTGAA
AACGTGGATC GGGGGGGGAA CAGACTTTTT AGAGCGGCGG GCGCGTCGGC GGACGACGAC
GGATGGGCGT GCGCGATCAC GAGACGACGA GACTGACTGG AGAATATAAA CTTCGTTCGA
TCGCAGGCCA AGTCTGTCTC CACGACGGTC AAGGCTGACA TCTACCCGGA ATTCGGTACC
TACCCGGGCG GTGGCGAATC CCCGATCATC CCGTTCGGCG ACGAAAAGAA CGCCGAGCGT
GAAGTGATCC ACGGCCGCTG GGCGATGCTT GGCGTCACCG GTGCGTGGGC CGCCGAAAAC
GGCACCGGCA TCCCGTGGTT CACCGCGGGT ACCTTGTGCA CCCCGGATGA CTGCACCGCC
GTCGCGGACA AGTTCCCGGG CGCCGTCGCC CCGCTCGCGC CGGAAGGCTC TGGCTACCCG
TCCTTCTGGA ACGTTCTCAT CATCGAGATC GTTCTCGTCG GCGCCGCGGA AGCGTACCGT
ACCGGTATCT CCGACTCTCC GTTCGATGAT GGCCTCACCG TCGGTGACGT CAACCCGGGT
GGACGCTTCG ACCCGCTCGG CCTCGCCGAG TCTGGCGACC TTGAAGAACT CAAGATCAAG
GAGCTCAAGC ACTGCCGCTT GTCCATGTTC GCGTGGTTGG GCTGCATCTT CCAAGCGCTC
GCCACCCAAG AAGGCCCGAT CGCCAACTGG CAATCCCACG TTGCGGACCC GGTTCACTCC
AACGTCCTCA CCAACGCGGC CAAGGGCTTC GGCTTCTACT AAGCGGTTCA CCGTGCGTGA
CGTCGCTCTC GCGTCGTCTC TCTACCAACT CGTCTCTCTA CCAACTTAGA AGATTGCATT
TTAATAAAAA CGATTTTCGA ATCAACAAAA
 
Protein sequence
MSALLASSFV GRVAAFKATK IQAKSVSTTV KADIYPEFGT YPGGGESPII PFGDEKNAER 
EVIHGRWAML GVTGAWAAEN GTGIPWFTAG TLCTPDDCTA VADKFPGAVA PLAPEGSGYP
SFWNVLIIEI VLVGAAEAYR TGISDSPFDD GLTVGDVNPG GRFDPLGLAE SGDLEELKIK
ELKHCRLSMF AWLGCIFQAL ATQEGPIANW QSHVADPVHS NVLTNAAKGF GFY