Gene OSTLU_93490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_93490 
Symbol 
ID5004745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp19459 
End bp20916 
Gene Length1458 bp 
Protein Length485 aa 
Translation table 
GC content64% 
IMG OID640420166 
Productpredicted protein 
Protein accessionXP_001420562 
Protein GI145352459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCG TGAAACTCGA CGTCGCGACG CGCGCGTGGA ACGTCGACGG GAGCGCCGGC 
GCGCGCGCGG CGAGCGGGCT GTCGTGGATC GCGGAGGCAG AGCGCGTCAC GGGACGACGA
CGGGGACGGT GCGCGTACGA CGGATGCGAC GAGGAGGCGG AGCACGGCGG ACACGTCTGG
ATCGCGGGCG TGTCGACGCG AGGCGCCGAC GCGAAGTGCG CGTTGGTGCC GATATGCGCA
GATTGTAATT GGCCGGGCAA TACGCGACGG ATGCAGAACG GGGGGTCGTA TTTGCGCGCG
GGCGTGGTGG CGACGGCGGT GGAGATGACT CCGGAGATGC TGAACGCGCG GCGACGGTTC
GCGCGGGACG AAGGAGACGA GGACGACGGC GAGGACGAGG AAGACGGCGA GGACGAGGGA
GACGTGAGGG TGTGTGTTTC GTGCGGCACA GACATCTCGG GTATGCCGGC GAACCACACG
GTGTGTCTCG CGTGTTTTCG ACGCGGATCG CTGAGCGGCG ATGGTCGTTC GGATGCCCGT
CCTTGTGTTT CGTGCGGCAC AGACATCTCG GGTATGCCGG CGAACCACAC GGTGTGTCTC
GCGTGTTTTC GACGCGGATC GCTGAGCGGC GATGGTCGTT CGGATGCCCG TCCTTGTGTT
TCGTGCGGCA CAGACATCTC GGGTATGCCG GCGAACCACA CGGTGTGTCT CGCGTGTTTT
CGACGCGGAT CGCTGAGCGG CGATGGTCGT TCGGATGCCC GTCCTTGTGT TTCGTGCGGC
ACAGACATCT CGGGTATGCC GGCGAACCAT ACGGTGTGTC TCGCGTGTTT TCGACGCGGA
TCGCTGAGCG GCGATGGTCG TTCGGATGCC CGTCCTTGTG TTTCGTGCGG CACAGACATC
TCGGGTATGC CGGCGAACCA CACGGTGTGT CTCGCGTGTT TTCGACGCGG ATCGCTGAGC
GGCGATGGTC GTTCGGATGC TCGTCCTTGT GTTTCGTGCG GCACAGACAT CTCGGGTAGG
CCGGCGAACC ACACGGTGTG TCTCGCGTGT TTTCGACGCG GGTCGCTGAG CGGCGATGGT
CGTTCTGATG CCCGTCCTTG TGTTTCGTGC GGCACAGACA TCTCGGGTAT GCCGGCGAAC
CACACGGTGT GTCTCGCGTG TTTTCGACGC GGATCGCTGA GCGGCGATGG TCGTTCGGAT
GCCCGTCCTT GTGTTTCGTG CGGCACAGAC ATCTCGGGTA TGCCGGCGAA CCACACGGTG
TGTCTCGCGT GTTTTCGACG CGGATCGCTG AGCGGCGATG GTCGTTCGGA TGCCCGTCCT
TGTGTTTCGT GCGGCACAGA CATCTCGGGT ATGCCGGCGA ACCACACGGT GTGTTTCGCG
TGTTTTCGAC GCGGATCGTA CGACAGCGAG TACGACAGCG AGTACGACAG CGAGTACGAC
AGCGAGTACG ACAGCTAA
 
Protein sequence
METVKLDVAT RAWNVDGSAG ARAASGLSWI AEAERVTGRR RGRCAYDGCD EEAEHGGHVW 
IAGVSTRGAD AKCALVPICA DCNWPGNTRR MQNGGSYLRA GVVATAVEMT PEMLNARRRF
ARDEGDEDDG EDEEDGEDEG DVRVCVSCGT DISGMPANHT VCLACFRRGS LSGDGRSDAR
PCVSCGTDIS GMPANHTVCL ACFRRGSLSG DGRSDARPCV SCGTDISGMP ANHTVCLACF
RRGSLSGDGR SDARPCVSCG TDISGMPANH TVCLACFRRG SLSGDGRSDA RPCVSCGTDI
SGMPANHTVC LACFRRGSLS GDGRSDARPC VSCGTDISGR PANHTVCLAC FRRGSLSGDG
RSDARPCVSC GTDISGMPAN HTVCLACFRR GSLSGDGRSD ARPCVSCGTD ISGMPANHTV
CLACFRRGSL SGDGRSDARP CVSCGTDISG MPANHTVCFA CFRRGSYDSE YDSEYDSEYD
SEYDS