Gene OSTLU_35576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35576 
Symbol 
ID5002962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp223001 
End bp224149 
Gene Length1149 bp 
Protein Length382 aa 
Translation table 
GC content62% 
IMG OID640418383 
Productpredicted protein 
Protein accessionXP_001418643 
Protein GI145348413 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.98269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.38201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGCT ACGTCAAGGG ACGCACGCTC GGTGAGGGCA CGTTCGGCGT CGTCCACGAG 
GCGCGCGTCG AGGCGACGGG CGAGCGCGTG GCGATCAAAA AGATTCGACT CGGGAAACTC
AAGGAGGGCG TCAACTTCAC GGCGATACGC GAGATCAAAC TGCTGCAGGA GATCGAGCAC
GAGCACGTCA TCGCGCTCGT CGACGTGTTC GCGCACAAGA AGAACCTGAA CCTGGTGTTC
GAGTTCTGCG GCGGGGACCT GGAGATGGTG ATCAGGGACA AGACGGCGCC GCTGGAGCGA
GGGGAGGTGA AGTCGTACGC GATGATGACG CTGCGAGCGG TGGCGCACTG TCACGAGAGA
TGGGTGCTGC ACAGAGATTT GAAACCGAAC AACCTGTTGA TCGCGCCGAA CGGGTGCTTG
AAGTTGGCGG ATTTTGGGTT GGCGCGGATA TTCGGGTCGC CGGATAGACG GTTCACGCAT
CAGGTGTTCG CGAGGTGGTA TCGCGCGCCG GAGTTGTTGT TGGGGTCGAA GACGTACGGA
CCGGGCGTGG ATATTTGGGC CGTGGGGTGT ATCATCGCGG AATTGATGCT CCGGCGGCCG
TTCTTCGCGG GATCGTCGGA TATCGATCAG TTGGGGAAGG TGTACGCGGC GCTAGGGACG
CCGACGGAGA CGAATTGGCC GGGGGTGTCG GCGCTACCGG ATTTCATCGA GTTTGTGTAC
GTGCCGCCGC CGAATCTTCG CGATACGTTC CCGAACGAAA CGGACGAGGC GCTGGATCTG
TTGCGGAAGA TGCTCGAGTA CGATCCGAAT AAGCGTATCA CCGCCGCGCA GGCTTTAGAG
CATCCGTACT TTCACACCAA GCCCGCGCCG ATTCCGTACG AACAGCTTCC GAAGCGGTTC
GTCGCGAAAG AAGCCGAGGC GAACGCGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGGG
GATGAAGAGC CAGCTTCACC CGCGTCCGCA CGTCAGCCGA AGACTGGCGA GAAACGTAGA
CTAGAGGACA CCACCGACTC GACGGATCCA AACTTTCGCC CGAAGCTCGA CGAAGAGGAC
AGGGAATCTT TGCGAAAACG AAAAGGCGCG CTCGACGCCG CGTTCGCTGA CGTCGACGGA
GACGACTGA
 
Protein sequence
MDRYVKGRTL GEGTFGVVHE ARVEATGERV AIKKIRLGKL KEGVNFTAIR EIKLLQEIEH 
EHVIALVDVF AHKKNLNLVF EFCGGDLEMV IRDKTAPLER GEVKSYAMMT LRAVAHCHER
WVLHRDLKPN NLLIAPNGCL KLADFGLARI FGSPDRRFTH QVFARWYRAP ELLLGSKTYG
PGVDIWAVGC IIAELMLRRP FFAGSSDIDQ LGKVYAALGT PTETNWPGVS ALPDFIEFVY
VPPPNLRDTF PNETDEALDL LRKMLEYDPN KRITAAQALE HPYFHTKPAP IPYEQLPKRF
VAKEAEANAA AAAAAAAAAG DEEPASPASA RQPKTGEKRR LEDTTDSTDP NFRPKLDEED
RESLRKRKGA LDAAFADVDG DD