Gene OSTLU_24045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24045 
Symbollhca4 
ID5000142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp280656 
End bp281836 
Gene Length1181 bp 
Protein Length203 aa 
Translation table 
GC content69% 
IMG OID640415563 
ProductPhotosystem I light harvesting complex, chlorophyll a/b binding 
Protein accessionXP_001416356 
Protein GI145343493 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0272669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCTCTTCCA CCGCCGTCGC CAGGTGCGCG CTCGACGCGA CGTCGCGCGC TCGGAGACGC 
GCCGCGAACG GATCGCGCGC GTCGTATCGC GATCGTATCG TATCGCGCGA GGGAGATACG
GCGACGGGCT CGCGCGCGCG CGGACGCGGC GCGCGCGCGC GGACGCGCGT CTTCGAACGG
GGATTTCGAT TCGAGGGAAT TTCGCGCGTC GCGCGCGGCG ATCGAGCGGC GCGGAAGCGC
GCGCGAGGGC GCGGGGGGGG CGGACGTTCG AGCGGCGATC GGGCGCGGCG GGACGGGGCG
GGACGCGCGC GGCGACGCGC GCGGGGCTTC GGACTCGACG CCGTCCGCGG CGCCGCCGCG
GGCGCGCGAG CGACGCGAGG GAGCGACGGC GCGCGGCGAG GCGGCGCGGC GAGACGCGCG
GCGCGAGGGA GGGATGGGAT GGACGGGACG CGAACGCGAC GAACGGGCGA CTGACGACGA
ACGCGCGATT TCGCGAATGC AGTTTGCGCG CGGCGAAGGC GCAAAAGCGT CAAAGCTTGA
AGACGCGCGC GAGCGCCGCG ACGCAACCGA TGTGGATGCC GGGTGCGACC GCGCCGGCGC
ACTTGAAGAA CGAGTTGCCG GGTGATTACG GCTTCGATCC GCTCGAACTC GGCAAGGATC
CGGAGGATTT GAAGTGGTAC GTGCAAGCCG AGCTTCAACA CGGCCGCTGG GCGATGCTCG
GCGTCGCCGG CGCCGCGGCG CCGGAGATCC TCACCAACAT GGGCATCAGC AACTTGCCGA
ACTGGCACGA GGCCCCGTCG TACGACGGCT ACTTCACCGA CGCCACGACT TTGTTCTGGG
TGCAAATGCT CATGATGAAC TGGGCCGAAG TGCGCAGATG GCAAGACATT CGCAAGCCGG
GTTCGGTGAG CGAAGATCCG ACGCCGTTCT CCAACGCCAA GCTCCCGGCT GGCGTTGTGG
GTTACCCGGG TGGCATCTTC GACCCGCTCG GCTACGCCAA GGGTGACTTG AAGACCTTGA
AGGCGAAGGA GATCGCCAAC GGTCGTCTCG CGATGGTCGC CTTCGCCGGC ATCATGGTGC
AATACGACCA CACCGGCGTC GGCCCGGTGG CCAACTTGGT GTCCCACATG GCGGATCCCG
CGCACAACAA CGTGTTCGCC GCCAAGTTCA TCGGCTTCTA A
 
Protein sequence
MWMPGATAPA HLKNELPGDY GFDPLELGKD PEDLKWYVQA ELQHGRWAML GVAGAAAPEI 
LTNMGISNLP NWHEAPSYDG YFTDATTLFW VQMLMMNWAE VRRWQDIRKP GSVSEDPTPF
SNAKLPAGVV GYPGGIFDPL GYAKGDLKTL KAKEIANGRL AMVAFAGIMV QYDHTGVGPV
ANLVSHMADP AHNNVFAAKF IGF