Gene OSTLU_36156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36156 
Symbol 
ID5000252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp335660 
End bp336940 
Gene Length1281 bp 
Protein Length426 aa 
Translation table 
GC content56% 
IMG OID640415673 
Productpredicted protein 
Protein accessionXP_001416134 
Protein GI145342105 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.838478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAGC GCGCGTGGGC CACCATGCAC AGCGCGATTG CACTGTGCGG TGCGGCGTTC 
ATCAAGTGGG CGCAGTGGGC GAGCACGCGT GAGGACGTCT TCCCGAAGGA CCTGTGTCAT
CAGCTCGAAC AACTTCATGA CGATGCCCCA CAGCACTCGT ACGGGCAAAC GTTACGTATT
TTGGCGAACG AGTTTGGTTT TGATCCGAGA TTGGTTTTTG CGCAGTTTCC ACGGAAGCCC
ATGGCGTCGG GTTCGGTTGC TCAAGTGTAC AAGGCGCGTT TGCGCAAGGA AGTCGCCGCG
GTGTGCTCGG CGCTCGCTCA ACCTCGCAAG CTCGATTTGA ACGAGGATGG CACCATGGAC
GTCGCGGTGA AAGTGCGGCA TCCCAACGTG GCGAAACGAA TTTTTCTGGA TTTTCAAATC
TTGCGCGCGA TAGCGGGGAT TGCAGACGCG TTACCCGCGC TGAAGGGTAT GCAGCTGAAG
AATACGCTCG GACAGTTCAG TCACACCATG ACGGCGCAGA CAGACTTACG CAACGAGGCC
GATCACTTGT TGAAATTTAC GCACAACATG AAGAGTGAGG TGCAGATACG AGCACCCCGC
CCTGTGCCAG GTTTCGTCAC GGAGGCAGTG CTGGTGGAGA CCTTTGTCAA GGGCGACGGG
TTGGGCAGCG CCATCAAACA TAAAACTTCT CGAAACTCTG AGCTTTGCTC ACTTGGCGTT
CACGCATACT TGCTCATGCT TCTACGCGAC AACTTCATCC ACCAAGATTT ACATCCAGGA
AATATTCTGT ACAGCGTCGA GAACGGCCAG TCGAGTGTCG ACTCAGAAAC AGCGGGGATC
GTGAAACTTG AGTTGATTGA TTTCGGCATC GCAGACGAAC TCCCGAAAGC CGTGAGGAAT
AAGTTTCTCG GATTTTTGTG CTTCTTACTG CGAGGCGAGG GAGAAAAGGC GGCCGACGTC
GCACTGACGT GGGATGCCAA ACAAACGTGC ACCGATCGCG TGGCGCTTCG GCGTGACATG
GCGCGACTCG TTTCGAGCCA AGGAGACGTG TACACGCAGC GCGTCGATTT GGATGGACTG
TTGAAGGAGA TCATGCAGCT TTTCCGCAAG CACGGCGTGA GCATCGACGG CATATATGCC
TCGCTCGTCG TATCTCTCTG CGTTTTGGTG GGATTCGCGA CGAGCCTCGA CCCAGAAATC
AATTTGTTCG AAGTCGCCGC GCCATCCGTC ATGGCGTTCG CGCTCACCGG AGACGTCGTC
GGGCGTTTGT TCAAAGGCTG A
 
Protein sequence
MRKRAWATMH SAIALCGAAF IKWAQWASTR EDVFPKDLCH QLEQLHDDAP QHSYGQTLRI 
LANEFGFDPR LVFAQFPRKP MASGSVAQVY KARLRKEVAA VCSALAQPRK LDLNEDGTMD
VAVKVRHPNV AKRIFLDFQI LRAIAGIADA LPALKGMQLK NTLGQFSHTM TAQTDLRNEA
DHLLKFTHNM KSEVQIRAPR PVPGFVTEAV LVETFVKGDG LGSAIKHKTS RNSELCSLGV
HAYLLMLLRD NFIHQDLHPG NILYSVENGQ SSVDSETAGI VKLELIDFGI ADELPKAVRN
KFLGFLCFLL RGEGEKAADV ALTWDAKQTC TDRVALRRDM ARLVSSQGDV YTQRVDLDGL
LKEIMQLFRK HGVSIDGIYA SLVVSLCVLV GFATSLDPEI NLFEVAAPSV MAFALTGDVV
GRLFKG