Gene OSTLU_4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4231 
Symbol 
ID5003365 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp143545 
End bp144600 
Gene Length1056 bp 
Protein Length322 aa 
Translation table 
GC content62% 
IMG OID640418786 
Productpredicted protein 
Protein accessionXP_001419293 
Protein GI145349754 
COG category[Z] Cytoskeleton 
COG ID[COG5059] Kinesin-like protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.892014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.224356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGGAGCGTCC GCGAGAGCGT CGCGGGCAAG GCGGCGCTGA GTTTCGCGTA CGACCACGTG 
TGCGATCAGT CGTCGGCGCA GGAGGAGATT TTCGAACGCG TCGGACGCGA CGCGGTGGAC
GGCGTGGTGG AGGGGTATCA CGGGTGCGTG CTCGCGTACG GGCAGACGGG CGCGGGGAAG
ACGTACTCGA TGCAAGGCGT GGATTTAGAT CGCGACGACG ACGTCGGAGG GTTTGAGGGT
GACGATGATT GCGTCATGAT GTCGCCGGGC GAGGCGGGAG GGGCGGAGGG GGACGCGCTG
GACGCGCCGA ACGCGGGGTT GATTCCGCGC GCGCTGAAGC GATTGTTTGA GCGATGCGAG
TCGGCGAGGA ACGCGGCAAT CGAAGCGGGC GGGGCGTGCG AAATCGAAGT GAAGTGCTCG
TATTTGGAGA TTTATAACGA GACGTTGCGA GATTTGTTGA TGAATACCGA GCACGATGGA
CCGGCGCCGA ACGTGCGGGA AGACGCCAAG CGAGGCACGT TTGTGGAGAA TTTGCACGAG
GAGCGCGTGC ACGGGGCGGA GCAGACGTAC GAGACGTTTT TGCGCGGTGC GGCGAATCGT
AGGGTGGGTC AGACGAATAT GAATGCCGAT TCTTCGCGTT CGCACAGCGT GTTCACGATT
TCGGTGGAAT CGCGCACGAA GGCGCATCCC ACGGCGCCGA CGACAAAAAA GAGCGCGCTT
TTGCACTTGG TCGATCTCGC AGGGAGCGAG CGGCAGAAGA GCACGGACGC GGCAGGTGAA
CGTTTGAAAG AGGCGAGCGC GATTAATAAA TCGCTCAGCG CGCTCGGGAA CGTCATCAAA
GCCCTCGTGG ACGTGGCCGA CGGCAAGGAA CGACACGTGC CCTACCGCGA TTCCAAGTTG
ACGTTTTTGC TCAAGGACGC GCTCGGCGGA CGCGCGCGCT GCACGCTCCT CGCGTGCGTC
TCGCCGGCGC ATGTGAACGT GGAGGAGACA ATGTCTACGC TGAAATTCGC CCAGCGCGCC
AAGCTTGTGA AAGTCCGCGC AGTGGCGAAC GAAGAA
 
Protein sequence
GSVRESVAGK AALSFAYDHV CDQSSAQEEI FERVGRDAVD GVVEGYHGCV LAYGQTGAGK 
TYSMQGGDAL DAPNAGLIPR ALKRLFERCE SARNAAIEAG GACEIEVKCS YLEIYNETLR
DLLMNTEHDG PAPNVREDAK RGTFVENLHE ERVHGAEQTY ETFLRGAANR RVGQTNMNAD
SSRSHSVFTI SVESRTKAHP TAPTTKKSAL LHLVDLAGSE RQKSTDAAGE RLKEASAINK
SLSALGNVIK ALVDVADGKE RHVPYRDSKL TFLLKDALGG RARCTLLACV SPAHVNVEET
MSTLKFAQRA KLVKVRAVAN EE