Gene OSTLU_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_2153 
Symbol 
ID5005358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp228339 
End bp229529 
Gene Length1191 bp 
Protein Length397 aa 
Translation table 
GC content56% 
IMG OID640420779 
Productpredicted protein 
Protein accessionXP_001421425 
Protein GI145354297 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGCCGGTGA TCGTGCGACG GACGTTTGAG GCGCTGGGGG CGACGTACGT GAAATTGGGA 
CAGTTCATCG CGAGCGCGCC GAGCGTGTTT CCGAAGGAGT ACGTGGAGGA GTTTCAAAAG
TGCCTGGACG CGACCGAGGT GACGGATTTT TCCATCATTA AGCGGACGAT CGAGAAAGAC
TTGGGACGTT CTATAGATGA TGTGTTCGCG ACGATCGATC CGGTGCCTTT GGCGAGCGCG
AGCGTGGCGC AGGTGCATCG AGCGACGCTG CTCGGAAGCG GGCGAGACGT CGTCGTCAAG
GTGCTGAAAC CGAACGTCGA GGACACGCTC AAGGCGGATT TGAGCTTTGT GTTAATCGTG
AGCAAGGTGT TACAGTTTTT GAATCCTGAA CTCTCGCGAA CATCGTTGGT GGACATCGTC
GGAGACATTC GAGAGTCGAT GTTGGAGGAG ACGGATTTTA GAAAGGAGGC GCAAAACGTC
GACGCTTTTC GACGATACCT TGAAGACGCG GAACTGACGA ACATCGCCAA GGCGCCGCAA
GTGTACAAAC AATTTAGTGG TAAACGAGTG ATGGTGATGG AGTACTTCTC TGGCGTCCCG
CTCACGGACT TGGAGGCGAT TCGTTCGGTG AGCACGCGCG ATCCCGAGGC GACGCTCATC
AACGCGCTCA ACGTTTGGTT TGGCAGCGTG CTGGCGTGCG AGAGTTTTCA CGCCGACGTG
CACGCGGGTA ATCTGATCGT TTGTCCGGAC GGGCGCGTTG GTTTCATCGA CTTCGGCATC
GTCGGCAAAA TTTCCCCGTC AATTTGGGGC GCGGTGCAAG CTTTTTTCCA ATCCACCGCC
GCGCGCGATT ACGAGCGCAT GGCGCTCGCG CTGGTGACGA TGGGGGCCAC CGACGGCGAA
GTCGACGTCA AGAAATTCGC TAATGATTTA CGCAAAGTCT ACGAAACCTT AGATTCCATC
GAACCGACTG TTCTTGTCGA TGAAGACACC TTCGACGGCA CCCCTCGCGC CGCCGTGACC
GTCGACCAAC AGCAAGTGAC GCAGTTGGCC TCCGACTTGA TCGTCGCCGC CGAGGAGAAC
AAAATCAAAC TCCCCAAAGC GTTCGGCATC TTGATCAAAC AACTGATTTA CTTTGACCGT
TACGTGCAGT TACTCGCACC CGACTTAGAG GTCATCGACG ACGATCGCGT G
 
Protein sequence
APVIVRRTFE ALGATYVKLG QFIASAPSVF PKEYVEEFQK CLDATEVTDF SIIKRTIEKD 
LGRSIDDVFA TIDPVPLASA SVAQVHRATL LGSGRDVVVK VLKPNVEDTL KADLSFVLIV
SKVLQFLNPE LSRTSLVDIV GDIRESMLEE TDFRKEAQNV DAFRRYLEDA ELTNIAKAPQ
VYKQFSGKRV MVMEYFSGVP LTDLEAIRSV STRDPEATLI NALNVWFGSV LACESFHADV
HAGNLIVCPD GRVGFIDFGI VGKISPSIWG AVQAFFQSTA ARDYERMALA LVTMGATDGE
VDVKKFANDL RKVYETLDSI EPTVLVDEDT FDGTPRAAVT VDQQQVTQLA SDLIVAAEEN
KIKLPKAFGI LIKQLIYFDR YVQLLAPDLE VIDDDRV