Gene OSTLU_33699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33699 
Symbollhca2 
ID5003874 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp658246 
End bp659280 
Gene Length1035 bp 
Protein Length235 aa 
Translation table 
GC content66% 
IMG OID640419295 
Productchlorophyll a/b binding, possibly photosystem I light harvesting complex 
Protein accessionXP_001419664 
Protein GI145350546 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTTT CCATGCGCAT TTCCTCCACG GCCGGGTGCG TGTTCGCGCT CGTTCGTTCC 
GCGTCCGCGT CCGCGTCCGC GCTCGCGCGT CGCGCGTTCG CGCCCGTCGC GCGCGGTCGC
GCCAGACGGA TCGTATCGAG CGTATCTTCG ATGCGCTCGA GCGTGCGATC GGGGGCGCGC
GCGCGTTCGC CCGGCGCGCG CGCGCGTACG CGCTCGGGTC GCGCGTCGAT GACGATGCGA
ACGAATCGAA CGCGTCGCAT CGCGCTCGAC GCGCCGCGTC GTTCGCGGCG TGGACGCCCG
ATTCCGTCGA CCCTTCGATC GCCGACTGAC TGATATCCCT CCTCGCTCTT CGCGCTCGCC
AGGCTCAAGA CCCGCGTCGC GACGAAGACT CGCGCGTCCC GCGGCCCGGT GGCGGTGTCC
GCGAACGCCG ACCGTCCGGT GTGGTACCCG GGTAAGGCGC CGGCGGCGCA CCTCGATGGG
TCCTTGCCGG GCGACTACGG CTTCGACCCG CTCTCGCTCT CGGCGGATCC GGAGATGCGC
CAGTGGATGG TGCAAGCCGA ACTTCAACAC GCGCGCTGGG CGATGTTGGG CGTCGCCGGT
TGCGTTGCCC CGGAGCTTTT GACCAAGATC GGTGTCGCGG ACTTGCCGAA CTGGGTCGAC
GCCGCCACTT ACCAATACTG GGCCCCGGGC GGGACGCTGT TCTTCATTCA AATGGCCATG
TTCAACTGGG CCGAAATTCG CCGCTGGCAA GACATGAAGA ACCCGGGCTC GGTCAACACG
GATCCGTTGT TCGGCTACAA CTCCAACGAC ACCAACACGG ACGTTGGCTA CCCGAAGGGC
TTGTTCGACA AGTTCGGCTG GGCTAAGGAT GAGAAGACCA CGGCGGAGCT CAAGTTGAAG
GAGATCAAGA ACGGGCGCTT GGCGATGCTC GCCTTCCTCG GCATCTGCGC GCAATACGTC
CAAACGGGCG TCGGTCCCGT CGAGAACTTG TTCAGCCACA TGGGTAACCC GGGTCAAGTT
GGTGTTTTCA TGTAG
 
Protein sequence
MSVSMRISST AGLKTRVATK TRASRGPVAV SANADRPVWY PGKAPAAHLD GSLPGDYGFD 
PLSLSADPEM RQWMVQAELQ HARWAMLGVA GCVAPELLTK IGVADLPNWV DAATYQYWAP
GGTLFFIQMA MFNWAEIRRW QDMKNPGSVN TDPLFGYNSN DTNTDVGYPK GLFDKFGWAK
DEKTTAELKL KEIKNGRLAM LAFLGICAQY VQTGVGPVEN LFSHMGNPGQ VGVFM