Gene OSTLU_32072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32072 
Symbol 
ID5002168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp224618 
End bp225786 
Gene Length1169 bp 
Protein Length381 aa 
Translation table 
GC content55% 
IMG OID640417589 
Productpredicted protein 
Protein accessionXP_001418185 
Protein GI145347465 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.186816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.540489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGCGCGCG ACCGAGCGCA TCGATGTTCT CGTCGTTCAG CATCACCGGT GCGGCGGCGG 
AGGTGTACGA CGTCGAACTC GCGTACGAAG ACGCGCGTCG CGGGCGACGC GACGGCGAAA
AGGCGTCGAC GACGACGACG ACGACCGAGG TGACGTTCGA CTCCAAGGAG GATCCCGAGG
AGGTGATTGA TAAACTCGTG AAACAGCGAC GAGATCTGCC GTACGAGACG GGATACGAAT
GCGTCGTCAT CCGAGCGCTT TGCTCGGCGT CGAAAGTCGC GATACAGCTC GGGGTGGAGA
AGTATCAACC GAGACCGGTG GCGTTGAAGG TGACTTCGAA TGGAAAGATG AACGCGGGAC
AGTCGCACTC GAGCTTGGCG TCGGTGCTCA AGCACTGTCA GCTCGGCTCG CTGCGGAAGC
TGAGCGTGCA AGATGGATTG TTGTCCACGG TTTTGTCTTT ACGAGATAAA TGGACCTCGG
TTCGAGAGTT GGACATTTCG AACAACGCGT TGGAAACGCT GCCGAAAGAG CTGTTCGCGC
GTTTCCCGTA CCTGGAGGTT CTTCGTTTGG ACGGGAATAA ACTCGCGACT TTGCCCAACT
TGAACGCGTT CACGCTTCTC AAAGAGTTGC ATGCGAACGG CAACGCGTTG TCAACGCTGC
CAATCGACAT GGTGGAAGAT TTGGATTTGG AAGTTTTGTC CGTCGAGTTC AACCGCTTGA
GCAAGCTGCA CGTCAAGTTG AAGGATTTGT CCAAACTGCG CGTGTTGCGG TTACTTGAGA
ATCCCATAGA GACGCTGCCC CGGTTGAATA AGACCGCCAA TCAAGAGTGC TTATCGCTCG
CAAACGTGAA TGTTTCGAGG AATGGAGCAA CAGGCGGTGT CTCCGTACAG GTTCGCGAAA
CGAGCTCGTC TTACTTTTCC AGCATAGTAG GCGGCAAGAC AACGTCCAAG GAAAAAGCGT
ACAACGCTTT CCTGAGCTTG ATCTTTCGTA GCAGTGAATG TCAAAACGCA TTACTCGTCG
CCGCCGTCGC GGTGATCGCG TCGAAGAGCC GAGAAAACTG CGAAGCCATA GTGCTGACCG
AAGGTGCGAG CGTTCGACCG CTTCTTCACT CTGGGGAAAA TTTACATGTG AAACTGACCG
TACGTCCGTC CGCACCACAG GAGCCGTGA
 
Protein sequence
MFSSFSITGA AAEVYDVELA YEDARRGRRD GEKASTTTTT TEVTFDSKED PEEVIDKLVK 
QRRDLPYETG YECVVIRALC SASKVAIQLG VEKYQPRPVA LKVTSNGKMN AGQSHSSLAS
VLKHCQLGSL RKLSVQDGLL STVLSLRDKW TSVRELDISN NALETLPKEL FARFPYLEVL
RLDGNKLATL PNLNAFTLLK ELHANGNALS TLPIDMVEDL DLEVLSVEFN RLSKLHVKLK
DLSKLRVLRL LENPIETLPR LNKTANQECL SLANVNVSRN GATGGVSVQV RETSSSYFSS
IVGGKTTSKE KAYNAFLSLI FRSSECQNAL LVAAVAVIAS KSRENCEAIV LTEGASVRPL
LHSGENLHVK LTVRPSAPQE P