Gene OSTLU_31566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31566 
Symbol 
ID5002043 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp146867 
End bp147797 
Gene Length931 bp 
Protein Length275 aa 
Translation table 
GC content62% 
IMG OID640417464 
Productpredicted protein 
Protein accessionXP_001417695 
Protein GI145346441 
COG category[R] General function prediction only 
COG ID[COG0220] Predicted S-adenosylmethionine-dependent methyltransferase 
TIGRFAM ID[TIGR00091] tRNA (guanine-N(7)-)-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGC CGGCGTCCAC CGTCGCGGGC ACGCGCGTGG TGAAGATCGG GGACGCGTCG 
CTCGAGGTGC GGACGCGCGC GACGCGCGCG CGCGACGGCG CGACGGCGCG AACGCGCCAA
CGCGCGCGAT GGGCGATGGG CGACTGACGG CGCGGTGACG ACGACGCAGG TGCCCTCGTT
CGTGAGCGCG TCGAACGTCG GGACGAGCGC GCCGCGAAAG AAGCATTTTC GTCAGCGCGC
GCACTGTAAT CCGTTGAACG ATGGGTACTA TTACGCGCCC GTGGGACCGT GGGAGATCGA
CTGGCGCGAG CACTACGAGG AATTTTTCGC GAAGCGGGAC GGCGACGGCG ACGCGGGAAG
TCTGAAGATA CGCTTCGCGG ACGTCGGATG TGGGTTCGGA GGGATGCTCG TGCGACTGGC
GGAGGTGTTT CCGGAAAAGT TGATGCTCGG GATGGAGATA CGGGACAAGG TGAGCGAGTA
CGTGCGCGAG CGGTGCGCGG CGTTGCGCAA GGACCACCCG GGTAAGTACT GGAACATTTC
GTGCGTGCGG GCGAACGCGA TGAAAAATTT GCCGCAATAT TTCGAAAAGG GGCAATTGGA
GAAGCTCTTC TTTTTGTTTC CCGATCCACA CTTCAAGGCG GCGAATCATA GGCGAAGAAT
CGTGACGACG ACGTTACTCG CCGAGTACGC GTACGTGCTC GCCGAGGGCG GAATTTTGTA
CACCATCACC GACGTCGAGG AACTCGGTAA ATGGATGTCC GATCACATGT CCGCGCATCC
AATGTTTGAA CGCGTGCCCG AGGCTGAGCT CACGAAAGAT CCCGTGGTGC CGCTCCTTTA
CACGGGCACC GAGGAAGGGC AAAAAGTTGA GCGAAACTCG GGTTCGACGT TTCTCAACGT
CTTCAGGCGC GTCGCCAACC CGAACCATTA G
 
Protein sequence
MATPASTVAG TRVVKIGDAS LEVPSFVSAS NVGTSAPRKK HFRQRAHCNP LNDGYYYAPV 
GPWEIDWREH YEEFFAKRDG DGDAGSLKIR FADVGCGFGG MLVRLAEVFP EKLMLGMEIR
DKVSEYVRER CAALRKDHPG KYWNISCVRA NAMKNLPQYF EKGQLEKLFF LFPDPHFKAA
NHRRRIVTTT LLAEYAYVLA EGGILYTITD VEELGKWMSD HMSAHPMFER VPEAELTKDP
VVPLLYTGTE EGQKVERNSG STFLNVFRRV ANPNH