Gene OSTLU_26466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26466 
Symbol 
ID5004589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp276942 
End bp278549 
Gene Length1608 bp 
Protein Length535 aa 
Translation table 
GC content63% 
IMG OID640420010 
Productpredicted protein 
Protein accessionXP_001420293 
Protein GI145351888 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.377719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.17293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACG TCGACGCGGA CGTCCTGCCG CTGTTGCTCC GGCTGCGAAG CGCGGGCGCC 
AGCGCGGAGG GGCTGCGACG AGGCGCGGCG GGCGTGCGGG CGTGGAGGGA CGCCCTCGCG
CGAGGGCTGC TGCCGGACGC GTCGCTCGAG TGGCCGGAGG ACGAGACGTT CAGGACGGCG
CTGATCGAGG CGCTGGGGGA TTTAGACATG GCCAGGTTCA CGCGACGGTT TCCGCCGGTG
CTGGACACGT TGATGAAGAA TGTGTTGGAT ATTTTGTACG TGTACGAACG CGATCGAGAG
GACGAAGACG CGACGCCGGA GTTGCCGCCG ACGGAGCCGC GGGATTCGGA GACGGCGAAC
GATGGAGAGG GAGAGGGAGA CGCGCGAGGA AGCGGCGCGG GCGAGAGCGA CGAAGAGGAG
GGGGAGGGCG AGGCGCGGAG TCAGGCGGGA GGGCGAGGCG GCGCGGGGGA AGAAACCGAC
GGGAGCGATA ACGTCGACGA ATTCGACGTG GGGATGGATG GCGACGACGG CGCGAACGAG
GCGATGGAGC GGGCGAAGGA GAAGAATAAG GAGATCGTCT CGCGGCTCAT GGAGGAGTTC
AAAGAGCAGT GGGAACCGGC GATGGATAAG CTCGACAAGG CGGCCAAGGC ATTCGAAGGA
TTAGATTTAG ACGACCTCGC CGACGGCCCG GAAGGCTTCG ACCTCACGCG AGGTCTGTGG
CAGCAGACTG GGTGGAAGGA GCTCGATTCG CTTCGCAAAA AGCTGCAAGA CTTGAAAGAG
TTGCGCGACA TGGTGCGCAG TTTGGGCCGA GGCAGTGGTC GCGGACCTTT GCGTCGCGCG
CCGCGACAAA GAGAGCGCCA AGGATTCCCC ATAGGTCTCG TGCGAAGTCC GATGGAGCCC
GAACAAACAT CCGGTTTGTG CCGCTCGGAC GATTTGTCCC GCATGATGCC GAGCGAGATG
GTGCTACTCG CGTCGAGTCT CCCGCAAGCG CGTCTTTTGC ACTTTGCGCG TCGCGCCGAG
CGCACTTTGC TGTCGTATGA GCGCGTGGGG TGGTCGGAAG AACCCGCGGT GACTGTAGAG
GGCTTCGAGA CGCGCCCCGC GGCGGAGTGC GGGCCAATCA TCGTGTGCCT GGACACCTCG
GGGTCGATGA TGGGCGCTCG CGAGACCGTC GCCAAAGCCA TGGTTCTCGA GTGCATGCGG
CAAAGTCGCT CGCAGCAGCG CGCGTGTTAT TTATATTCTT TTAGTGGCCC AGGAGATTGC
CAAGAGCTCG AGCTCAAGCT CAACGCCGCC GGTCTCTACG GTTTGTTGGA ATTTCTCAGC
GGTAGCTTCC ACGGCGGCAC CGACGTCGAC GAGCCATTCA ATCGCGCGCT CGCTCGGTTA
AACGAGGCCG AATGGAGCAA CGCTGATATA TTGCTCGTCA CCGACGGGGA AATCAAACCT
CCCGACGAAA CCTTGATCGC CAATCTCAAC GAGGCAAAGG AAGAGATGGG ATTAAAGGTG
CACGGTTTGC TCGTCGGCGA CGCCGGCAAC GCCGAAGTCG TGGAATCGAT TTGCACTCAC
GTACACGCGT TCAAGTCCTG GACCGCGGTC GGCGGCAAGC CATCGTAA
 
Protein sequence
MRDVDADVLP LLLRLRSAGA SAEGLRRGAA GVRAWRDALA RGLLPDASLE WPEDETFRTA 
LIEALGDLDM ARFTRRFPPV LDTLMKNVLD ILYVYERDRE DEDATPELPP TEPRDSETAN
DGEGEGDARG SGAGESDEEE GEGEARSQAG GRGGAGEETD GSDNVDEFDV GMDGDDGANE
AMERAKEKNK EIVSRLMEEF KEQWEPAMDK LDKAAKAFEG LDLDDLADGP EGFDLTRGLW
QQTGWKELDS LRKKLQDLKE LRDMVRSLGR GSGRGPLRRA PRQRERQGFP IGLVRSPMEP
EQTSGLCRSD DLSRMMPSEM VLLASSLPQA RLLHFARRAE RTLLSYERVG WSEEPAVTVE
GFETRPAAEC GPIIVCLDTS GSMMGARETV AKAMVLECMR QSRSQQRACY LYSFSGPGDC
QELELKLNAA GLYGLLEFLS GSFHGGTDVD EPFNRALARL NEAEWSNADI LLVTDGEIKP
PDETLIANLN EAKEEMGLKV HGLLVGDAGN AEVVESICTH VHAFKSWTAV GGKPS