Gene OSTLU_38355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38355 
Symbol 
ID5004224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp123680 
End bp124861 
Gene Length1182 bp 
Protein Length393 aa 
Translation table 
GC content51% 
IMG OID640419645 
Productpredicted protein 
Protein accessionXP_001420084 
Protein GI145351435 
COG category[Z] Cytoskeleton 
COG ID[COG5059] Kinesin-like protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.12228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTCG CCAACATCGT GAATGATGAG CGAAGCACAT CTCAGCTCAT CGCTCGACTG 
CAAGAGGTGA AAGATCAGCT AGCCGCGCGC GAAGAAGAAC TTCTACATGC CATGGTGACC
CGCCGACACT TGCACAACAC CATTCAAGAG TTGAAGGGCA ACATTCGAGT GTTCTGCCGC
ATTCGACCGT CTTCCGAAGA CGAGAGTGCG TTTGACGATT CCAATCTCGC GATCGATCGC
AAGGGTGAAT TTGCCGGCCG ACGACTCGAA ATCACGCCTC CGGATGCGCC GAAAAAATAT
GATTTCACGT TCGATCGCGT TTTCGCAAAG AAAGACAGTC AAAAGCACGT TTTCGATGAA
GTTTCTTTGC TCGTGCAAAG CGCGTTGGAC GGTTACAAAG TTTGCATCTT CACTTACGGT
CAAACGGGTA GCGGTAAAAC GTACACGATG TTGGGCGGTA AGGGCGAAGA GCGTGGGTTG
ATTCCGAGGT CCATGGAACA AATTTTCGCC TCACAATCTT TGCTCGAGTC CAAAGGCTTG
AAAGTGTCCA TCACCGCCAC ACTGCTTGAG ATTTACAACG AAGACATTCG AGATTTGCTC
GCGTCGTCGC CGGGTGCGAA GATTGAGTAC AAGATTAAGC ACGACGACGA CGGAAACACT
CGCGTGACGA ACTTGTGCGA AGTTGAGGTT TTTTCTGCGG CGGAGGTTGA GTCATTGATG
CAACAAGCCA ACGCTGCTCG CGCTGTAGCG AAGACGAATA TGAACGATCG AAGCTCGCGA
TCGCACATGG TTATGCGACT TTGCTTGGAC GGCGTCAACG AAGCGGGTGA ACCAATCCAC
GGCGCGCTCA ATTTGGTTGA TTTGGCCGGG AGCGAACGAT TAAGCCGCAC GGGCGCCACG
GGTGATCGTC TCAAAGAAGC GCAAGCCATC AATAAATCTC TCTCAAGTCT TGGTGACGTC
ATCTTTGCCT TGGCGAGCAA GGAAAAGCAC ATTCCGTTCC GCAACTCAAA GTTGACGTAT
TTGCTCAAAA ATTCACTCGG CGGCGATTGT AAAACGTTGA TGTTGGTCAA CGTCTCGCCG
TCTCTAGAAA GCGCTCAGGA GACCATATGT TCTTTGCGAT TCGCGGCTAA GGTGAATTCC
TGTGCGCTGA AAAGCGCCCC GTCGTCGAAA ACGAAAAAAT GA
 
Protein sequence
MQLANIVNDE RSTSQLIARL QEVKDQLAAR EEELLHAMVT RRHLHNTIQE LKGNIRVFCR 
IRPSSEDESA FDDSNLAIDR KGEFAGRRLE ITPPDAPKKY DFTFDRVFAK KDSQKHVFDE
VSLLVQSALD GYKVCIFTYG QTGSGKTYTM LGGKGEERGL IPRSMEQIFA SQSLLESKGL
KVSITATLLE IYNEDIRDLL ASSPGAKIEY KIKHDDDGNT RVTNLCEVEV FSAAEVESLM
QQANAARAVA KTNMNDRSSR SHMVMRLCLD GVNEAGEPIH GALNLVDLAG SERLSRTGAT
GDRLKEAQAI NKSLSSLGDV IFALASKEKH IPFRNSKLTY LLKNSLGGDC KTLMLVNVSP
SLESAQETIC SLRFAAKVNS CALKSAPSSK TKK