Gene OSTLU_47897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_47897 
Symbollhca3 
ID5006296 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp232863 
End bp234155 
Gene Length1293 bp 
Protein Length281 aa 
Translation table 
GC content62% 
IMG OID640421717 
Productphotosystem I light harvesting complex, chlorophyll a/b binding 
Protein accessionXP_001422134 
Protein GI145355794 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.266881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.436801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATTCCCGCAC GCACCCCGAT GTTCGCGCGA ACCTTGAACA CCGTCACCCT CGCGAGCGCC 
CCCGCGCGCG CGACGAAGCG CTGCGCGAAG CGCGCGACCG TGGTCGTGCG CGCCGAGGGC
GAAACCGAGG CCGAGGCCGC GCCGGCGCCG GTCAAGGTGC GACGATTCGA CGACGCGCGC
GCGCGATGCG GGATCGCGTT TCGGACGTTC GGGCGGTGTG ACGACGCGCG CGAACGCGCG
GTGGGACGAA CGACGGCGCG AAACTGTGGA TGGGAAAAGT TTGCGGCGGA TCGACGGCGT
CGCGAGGGCG CGCGCGCGAG GGCGCGCGCG CGAGGGAGGG GCGATGGCGA TCGGGGCGTT
GATACGATCG CGCACCGTAT CTCGCGACGG GGTTCAAAGA TTTGGGTGAT TTGTTGAGTC
GTGACTCGCC AATCGTGGGG GTTGGGCGAG CGCGCGCGAG ACGAGACGAG ACGCGCGGTG
ACTGACGACG AATGCGTGTT TTTGTGTGTT TTACAGAAGG CGGCGCGCAA GACGGATGAC
CTCGCGCCGT TGTACGTGCT CGGGAACTCT GACCAGTCGT TGTCTTACCT CGACGGCTCT
TTGCCGGGCG ACTACGGGTT CGATCCGCTC GGGTTGTCCG ACCCGGAAGG TGCCGGTGGT
TTCATCAACC CGCAATGGTT GGCGTACTCT GAAGTGATCC ACGGTCGATG GGCAATGCTC
GGTGTGGCCG GTATGGTTGC CCCGGAGGTG CTCGGCGGCC TCGGCATCAT CCCGCAAGAA
ACCGGCTTGG TCTGGTACAA GGCGGGTATG ATCCCGGCGC AAGGCACGTA CGATTACTGG
GCCAACCCGT TCACGATCTT CTGGATCAAC GCCGCGTTGA TGAACTTTGC TGAACTTCGC
CGCGCGCAAG ATTACTGGAA CCCGGGCTCC ATGGGCAAGC AAGAACTCAT CGGCTGGGAA
AAGATGCTCG GGGGCTCCGG CGAACCGGCA TACCCGGGTG GTTTCTTCAA CATCATGGGC
CAAGGCAAGT CCGACATGGC GAAGATGCGT GTCAAGGAAA TCAAGAACGG TCGTCTTGCG
ATGATGGCGT GCTTCGCGTG CGGCGCGCAA GCCGTGATGA CGGGTGAAGG CCCGGTGAAG
AACTTGATTG ACCACGTCTC CGATCCGTTC GGCCACAACT TGCTCGTGAA CTTCCAAAAC
ATTGGCGGCG TCTCTCCGTT CTAAGCGGGT TGACATTTAG CTAGTAGCAG CGCTTTGTTT
TGTATGTAAA CACAACAAAA GCATCATCAA AAA
 
Protein sequence
MFARTLNTVT LASAPARATK RCAKRATVVV RAEGETEAEA APAPVKKAAR KTDDLAPLYV 
LGNSDQSLSY LDGSLPGDYG FDPLGLSDPE GAGGFINPQW LAYSEVIHGR WAMLGVAGMV
APEVLGGLGI IPQETGLVWY KAGMIPAQGT YDYWANPFTI FWINAALMNF AELRRAQDYW
NPGSMGKQEL IGWEKMLGGS GEPAYPGGFF NIMGQGKSDM AKMRVKEIKN GRLAMMACFA
CGAQAVMTGE GPVKNLIDHV SDPFGHNLLV NFQNIGGVSP F