Gene OSTLU_31850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31850 
Symbollhca5 
ID5001723 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp639957 
End bp641043 
Gene Length1087 bp 
Protein Length185 aa 
Translation table 
GC content63% 
IMG OID640417144 
Productchlorophyll a/b binding, possibly photosystem I light harvesting complex 
Protein accessionXP_001417828 
Protein GI145346713 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.719224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.42101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCGCACGTC ACGCACGCAC ACCCAGCGCA CGACACGCAC CGTCTCGTTC GTCGCACGCA 
TTTAACACAG CCATGATCGC ACAACGCGCG TCCACGACGC GAACGATCGC GACGCCGACG
CGCGCGACCG AAGCGCGACG CGCGCGCGTC GTCGCGAACG CGGCGGACCG AAAGATGTGG
TGCGTACGAA CGGAAGCGCG GATGGGGGAA TCGGAAGGAA GAATTACGTC GCGGGCGAAC
GCGCGAGCGT CGCGCGCGCG CGACGCGTCG CGCGACGCGT CGATGCGCGC GCGATGGCAT
CGGACGTATC CGAGATACGG GATATCTGAT ATGGCGCGCC GCGCGCGATC GGGCGACGAC
GGTTGGGAGG GCGAATCGGT CGCGCGACGC GCGAGTGACG CGCGGGTGGC GATGCGAATG
ATGATACTGA CGGAATGATT CGATGCTTAC GCGCGATGTA GGTTGCCCGC ACCGTACAAG
GCGCCGGCGC ACCTCGACGG CACCGTCGCC GGGGACTACG GCTTCGATCC TCTCGGCTTG
GGCAGCGACC CGACGCGCCT CAAGTATTAC CAAGAGGCGG AGCTCATGAA CGCTCGATGG
GCGATGATGG CTGTTGCGGG TATTGTCGGT ACCGAAATCG CGGGCATCGA ACCGCGATGG
TGGGAAGCCG GCACCGAGGA TTACGGATTC CCGCCGCAAG CGCTCCTCGC GGTGCAGCTT
CCGGTGATGG GGTACCTCGA GAACAAGCGC ATTCAAGGTT GGTTGGCCAC CGGTTCGAGC
GGTGTGAACG AAACCTTCCC GTTCGACCCG ATGGGCATGG GCTCTAAGGA CGAGAAGATG
AAGCTCAAGG AGATCAAGAA CGGCCGCGCC GCCATGATCG CCTTCGTCGG CATCGTCGTG
CAAGGCATCG TCTACCGCGA GGGCCCGGTC GCCGCGCTCA AGGATCACGT CGCCAACCCG
TTCGGTTGCA ACATGGCGAC GAACATCATG AACATCCCGG TGAACTTGGC GTAAACTGAG
CTACGACGTT TAACGGACGT ACGATTACGC CCGATGTGTA CACCGATCAG TCAACAAACA
ACCACCC
 
Protein sequence
MWLPAPYKAP AHLDGTVAGD YGFDPLGLGS DPTRLKYYQE AELMNARWAM MAVAGIVGTE 
IAGIEPRWWE AGTEDYGFPP QALLAVQLPV MGYLENKRIQ GWLATGSSGV NETFPFDPMG
MGSKDEKMKL KEIKNGRAAM IAFVGIVVQG IVYREGPVAA LKDHVANPFG CNMATNIMNI
PVNLA