Gene OSTLU_16547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16547 
Symbol 
ID5003198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp558876 
End bp559985 
Gene Length1110 bp 
Protein Length369 aa 
Translation table 
GC content70% 
IMG OID640418619 
Productpredicted protein 
Protein accessionXP_001419200 
Protein GI145349564 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.589126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.444988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGG CGAAGGGCGC GACGGCGACG GCGCGAGGAT CGGCGACGAG GGCGGCGTCG 
GCGCGCGAGA CGGCGGTGCG AACGCCGATG ATTCGCGAGG CGCGGCGGCG GGCGACGACG
AGGACGAGGA CGCGCGAGGG GGGGGAGGGC GGGCGCGGGG CGCGCGAGGA CGGTTTGAAA
TTCAAAGTCG GCGGCGACGA CGAGACGCGG TCGACGAGAG GGGCGAGCGC GAGCGCGTCG
GGGGGGGCGC GCGAGGCGGT GGTGGTTGGA ATGCGATTAG ACGCGCTGTT TGACGCCGCC
GCGGATCGGG AGACGAGGGA TTCGGAAGCG GCGACGAGGG CGGACGATTT GACGGCGTTC
GCGGCGGCGA TGACGCCGCG GGCGCGCGAG AGAGCGCCGC GGCGCGCGCT GGCGGTGTAC
GAGAAGACAT TCGTGTGTGG GATAGGAGGG TGTGAGAAGA GTTACGGGAG CGCTTCGAGT
TTGTGCGCGC ATAAACGAGC GAGACATCCG GGTTGGCGCG AAGCGGCGAC CGCGACGAAG
GCGACGACGA TCAAGAAGGA AGACGACGAA GACGTGATCG ACGGCGCCGG AGACGCCGGA
GACGCGGTCG ATGATGATGG TAACGTCTTC GACGACGCGT CGATGCGCCT GGCGCGCAAA
CGCGGGGCGA GCGGTTTGAA CGACGCGAAC CGCGCGTCGG CGCTCGGCGC CTACTTTGAA
ATCGTCGCCG CGGATGCGCA CGGACGTTTG AGTTCGGCGA ATCGAACCAA ACGGCGTCTC
GTTCGTCTTT CTAAAGACGC GGGCGCGTGC GCGAACGACC CGAGCGAGCC CGCGGAACGG
CGCGCCGCCG CCGCCGCCGC CGCGCGCGTC TTCGCGACGG TGGAAGAGTC CGTCGACGGC
GAGCAAGAGC GCGCCAGCGC TTGGTTACAC ACCCTCGACG ACGCCTCGAG GGCGGTTTCG
AAAACTCAAA CCTCGTCGTC GATCATCGAC GCACGAGCGC TCGACCCGGA CGCGCGTTCG
CACGCGCCGG AGCTCGTCCG CGCGTGCGTT CGGCACCGCA TCGCTCGCGT CTTACGCCGA
GCCTTCGTCC AAGCCGCGTC GCCGGCGTGA
 
Protein sequence
MDEAKGATAT ARGSATRAAS ARETAVRTPM IREARRRATT RTRTREGGEG GRGAREDGLK 
FKVGGDDETR STRGASASAS GGAREAVVVG MRLDALFDAA ADRETRDSEA ATRADDLTAF
AAAMTPRARE RAPRRALAVY EKTFVCGIGG CEKSYGSASS LCAHKRARHP GWREAATATK
ATTIKKEDDE DVIDGAGDAG DAVDDDGNVF DDASMRLARK RGASGLNDAN RASALGAYFE
IVAADAHGRL SSANRTKRRL VRLSKDAGAC ANDPSEPAER RAAAAAAARV FATVEESVDG
EQERASAWLH TLDDASRAVS KTQTSSSIID ARALDPDARS HAPELVRACV RHRIARVLRR
AFVQAASPA