Gene OSTLU_16898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16898 
Symbol 
ID5003516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp596195 
End bp597394 
Gene Length1200 bp 
Protein Length399 aa 
Translation table 
GC content61% 
IMG OID640418937 
Productpredicted protein 
Protein accessionXP_001419849 
Protein GI145350937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0312295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.201381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA AAGAGATCGA GGCGCGCGCG GTGCGATACA TGGGGATCGG GGACGATGGT 
AGTGGATGGA CGGCGGTGGA GGAAGATGGG AGCGGGGCTG GGGGGAGACG ACGAAGACAC
GATTCGGATT CGGAAGAGGC GACGGCGGCG GCGGCGGCGG CGCCGAGTCG CCGCGCGAGA
CACGATAGCG ATAGCGAAGA CGACGCCGGT GATGCTTCGG TACCGCCCGA GGATGCGCCA
ACGACGAGCG CGGGCGATGG GCTGCAGTAC GATAGCGATG GGGACTTGAT TATTCCTCAG
GAGCCCGCGG CGGCGGCGGC GGCGGCGGCG GCGGCGGGTG AGCCGCAATA CGACAGCGAT
GGCGATTTTA TCCTGCCTGA AGAGCCTTCC GGGCAGCAAG AGCTGCAGTA CGATAGCGAC
GGTGACTTGA TCTTACCGCC CGATCCCTTG CCGGAAGCGC CCGCGGAAGA TAATAAAAAG
AAGTCGAAGG AAAAGAAGAC GAAGGAGCAC AAGATGACGG ATGGCACGTC CACCGGTCTC
GTGAGCGCGG CGCAAGTCAT CATGGAAGCT GAGTTGAAGC GCAAAGCTGA GCAAGCGCGC
GTGGCTAAGA TGACGGACGA GCAAAGTGGT CGCGGGGCGG CGACGAACTA CCGGGACAAG
GCGACTGGGA AGCTCATGGA TAGCGAGGAG ATGAAGCGTC GCTCGGAGAA TGTCAAACCA
AAGGAGCGCG AACGACCGGT TTGGGCAACG GGTGTGGAAC AGGCGAGACA AGCGAAGCAG
TACGAGGTAG ATTTAGTCAA GGCAAAAGAC ACTCCGTTCG CGCACGCCGA CATCGATGCC
GATTATGAAG ACAAACAGCG AAGTGCGATG CGTTTCGGCG ACCCGATGGC GCATTTGAGC
CGCAAAAAGC GTCACGCTGA ATCGCTCAAT CTCCCATCCG TCGTCGACGG CTTAGGGTTG
TCGATGGATG ATTTGAAAAA GTCTGGTTTC CGAATCCCGC AAGAGGTTCC ACCGCACAGC
TGGCTTCGTC GCGGCGTCGT CGCGCCGCAC AACCGCTACG GCATCAAACC GGGCCGTCAC
TGGGACGGCG TCGACCGCGG CACGGGCTTC GAGGCAAAAA TGTTCCGAAA GAAGAGCGAA
TTGAAAGAGC GCGCGCAGCT CGAAGACGCG GACGCAGAGG AGCACAACGA ATGGTTTTAG
 
Protein sequence
MATKEIEARA VRYMGIGDDG SGWTAVEEDG SGAGGRRRRH DSDSEEATAA AAAAPSRRAR 
HDSDSEDDAG DASVPPEDAP TTSAGDGLQY DSDGDLIIPQ EPAAAAAAAA AAGEPQYDSD
GDFILPEEPS GQQELQYDSD GDLILPPDPL PEAPAEDNKK KSKEKKTKEH KMTDGTSTGL
VSAAQVIMEA ELKRKAEQAR VAKMTDEQSG RGAATNYRDK ATGKLMDSEE MKRRSENVKP
KERERPVWAT GVEQARQAKQ YEVDLVKAKD TPFAHADIDA DYEDKQRSAM RFGDPMAHLS
RKKRHAESLN LPSVVDGLGL SMDDLKKSGF RIPQEVPPHS WLRRGVVAPH NRYGIKPGRH
WDGVDRGTGF EAKMFRKKSE LKERAQLEDA DAEEHNEWF