Gene OSTLU_18884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18884 
Symbol 
ID5006448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp95513 
End bp96793 
Gene Length1281 bp 
Protein Length426 aa 
Translation table 
GC content61% 
IMG OID640421869 
Productpredicted protein 
Protein accessionXP_001422433 
Protein GI145356427 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.205808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000000697348 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAGCG ACGCCGTGAC GTGGCACGTG TACGTGCATA AACTTCGCGC CTGGGTGCCG 
ACGGAAGACA CGCGCGAGAA GGCGCCCGAG GAGCAAGAAC CGACGCGTCC AACGCTGATT
TTGGTCATCC GCACGCAGAC TGGAGAATTT CTGTCGCACG CGCACGGCGA CGACGAGCGC
GGGAACGTGG TGTACGACGA ACCGACGTGC GAAGAGGTTG TCGAATTTCT GTCTCGCGTC
GCGGAAAATC CAAAGGTGCT GAATTCGAGC AAAAAGGGCG ACGGCGAGGG GACGACGGCG
AAGCCGACGC GCGTGGCGTT CGCGTGCACG GCGACGGCGC GCGCGCTGGC GGGGAAATCG
CTCGAGGAGT GGGATTCGTT CGATGGGTGT AAGTATGTCA ATGGATGCCG AGATGGGGTG
ATGGCGAAAA TACCGAGCGT GCGAGATGTG AGTTTCGCGC CGGTGCCGGA GAGCGTGTTG
CGAGACGTGG TGCGAGGGCA AATCGAGCCG CAGATGCGGC CGAGCGAGAA TGACGAGGTC
TGGGGGACGC AGCATTTGCC GGGATTGATG GAATCCATCG ATGGGTTCAC GCCTCGGTTC
GGTGCGAGTT TATTCGGCGC GGCGAAGGCG TTCGCTGAGT GCGGGATTCG GCGTAAGCTT
GAACGGCGTC GACCGGTGAA AATCGCGTAC CGGTTAATGC TGCGCGACGA CGTCACGATG
CGGTTGACGT GCTTCGTCGC GTTCGATGGG ACGTTCGAGG ATGAAAACTT CGGATTCAAC
GTGCACAAGA CGCTGAAGGA TGCGCAGGTG GCGTTTGAGG TCGAGCACGG GAACGAGGAC
GTCGAGCCGA GCTTGGAGGG GCAGACGTGC ATGTTTTCGA GCGCTGTGGA GACGCCTTTC
GAGGATTTAG ACGCGCGCGA CGCGCACGAT TGGCCGTTAG TCGCGGCGGA GGGTGAAGTC
GGCGGCGTGT TGTGGCCGCT ATTCTTTAAA ATCGCCCTAG ACGAGGAATC CGGCGACAAT
CTCGAGATTT CCCGACCGGC GATCATCGAG CTTCAGGCGT TCGAACTCGC GATGAAAGCC
GTCGCGCAAT TGGCGCGACG CGACGAAGGA TTCGAATGCG CGCCCGCGGG CGAAAACGGT
GACTTGACCG AGGTTCGCAA GCCCCGCGGA CCGTGGACGA TCAAGACGTC GTCGTTCGCG
GCCAAGGGCG AGGAGGAAGA AATCGAGATC GAAGTGTCCT TACCCGAGCT CGCGGAGATG
AAGGCGAACG AGTACTTGTG A
 
Protein sequence
MASDAVTWHV YVHKLRAWVP TEDTREKAPE EQEPTRPTLI LVIRTQTGEF LSHAHGDDER 
GNVVYDEPTC EEVVEFLSRV AENPKVLNSS KKGDGEGTTA KPTRVAFACT ATARALAGKS
LEEWDSFDGC KYVNGCRDGV MAKIPSVRDV SFAPVPESVL RDVVRGQIEP QMRPSENDEV
WGTQHLPGLM ESIDGFTPRF GASLFGAAKA FAECGIRRKL ERRRPVKIAY RLMLRDDVTM
RLTCFVAFDG TFEDENFGFN VHKTLKDAQV AFEVEHGNED VEPSLEGQTC MFSSAVETPF
EDLDARDAHD WPLVAAEGEV GGVLWPLFFK IALDEESGDN LEISRPAIIE LQAFELAMKA
VAQLARRDEG FECAPAGENG DLTEVRKPRG PWTIKTSSFA AKGEEEEIEI EVSLPELAEM
KANEYL