Gene OSTLU_94479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94479 
Symbol 
ID5002174 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp254799 
End bp256049 
Gene Length1251 bp 
Protein Length330 aa 
Translation table 
GC content57% 
IMG OID640417595 
Productpredicted protein 
Protein accessionXP_001418195 
Protein GI145347486 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0456451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTG ATGAGAGCGA CGATAGCGTC GGGTTCTTAG GAGCTTACGA TGGGATGATG 
TCGGGGATGT TGGATCAACC TCGGACTTTC GTGAAGCTTG TCCCGGAGTG GGCCACGGCT
GGGCTGGGTT TTGAAGGCGG CGCCGACTCC AGCGACGGTG CGGAGGCGAA GAAGTTGAGC
CTCTTCGTCG CGGACGTCTT TCCATTGCTG CGTAAGTATC CAGACTGGGA TATGTCTGAT
ACGAACGCCA TTCGCTTTCT CACTAGACAC ATCAATGAAG AAAACTTCAG CGTCGATGAC
GTCGAGTTGT CTGATTTCAT GCGCGCCTCG AGCTCCACGG CGGCTCTTTC CAAGAGGGAA
AAGAGCACCA TCCGTGGTGT CTTACCTCCC AAAGACGTCA ATCGACGCGT GCCACTTTCA
CCGCACGCTT CATCCGATGA GACGCACGGG GTGCACCATT TCTTGACCAG ACTGCCGGCG
CTCGGCAAGG TGGAATTACG AGCGAAAGAC AACAGGCTTC CCACGAGCTT TGATGCGCGC
GTTGCGTACC CGAAATGCTC TCGACTTTTA GGCGCGGTCA GAGACCAGGG TAGATGTGGG
TCCTGCTGGG CCGTTGCGGC GACGGAAGTC ATGAATGATC GTCTCTGCGT CGCCACTGAC
GGCGAAAACG CGGACGAGCT CTCGCCCCAA TACGCCCTCT CGTGCTTCGA CAGCGGCTCA
GGTTGTGACG GTGGCGACGT CTTAGACACT TTACGGATCG CATTCACCAA GGGTATACCA
TACGGTGGTA TGTTAGACTC GAACGCGTGC TTACCGTATG AGTTTGAGGC GTGCGATCAT
CCATGCATGG TTGCTGGCAC GACGCCGCAG AGCTGTCCAG CGAAGTGCGC GGACGGTTCC
GCCTTGTCCT TTGTTCACCC CACATCCGAG CCATACACGT GCCCAAAAGG AGACGTCACG
TGTATTGCGA GAGAGATAAT GGAAAACGGT TCCGTGGCTG TGACTTTTGG ACCCGTCTAC
GCCGACTTCT ACAGGCACAC GGGCTCGGGT GTGTACACGG TCCCCAACGA CGCTGGCGAG
CCACTCGGAC AGCACGCGAC GAAGCTCATC GGTTGGGGGG TCTCGGAAGA GGGAGAACAT
TACTGGTGGA TGGTGAACTC GTGGAGAAAC TGGGGTGAAA ACGGCGTGAG CAAAGTGCGC
ATGGGTGAGA TGAATATCGA GTCGGGCATT GCCGCGATCG CCATGAAGTG A
 
Protein sequence
MKRDESDDSV GFLGAYDGMM SGMLDQPRTF VKLVPEWATA GLGFEGGADS SDGAEAKKLS 
LFVADVFPLL HVNRRVPLSP HASSDETHGV HHFLTRLPAL GKVELRAKDN RLPTSFDARV
AYPKCSRLLG AVRDQGRCGS CWAVAATEVM NDRLCVATDG ENADELSPQY ALSCFDSGSG
CDGGDVLDTL RIAFTKGIPY GGMLDSNACL PYEFEACDHP CMVAGTTPQS CPAKCADGSA
LSFVHPTSEP YTCPKGDVTH TGSGVYTVPN DAGEPLGQHA TKLIGWGVSE EGEHYWWMVN
SWRNWGENGV SKVRMGEMNI ESGIAAIAMK