Gene OSTLU_3554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3554 
Symbol 
ID5004057 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp172860 
End bp174087 
Gene Length1228 bp 
Protein Length375 aa 
Translation table 
GC content57% 
IMG OID640419478 
Productpredicted protein 
Protein accessionXP_001419925 
Protein GI145351102 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.298587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.425378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GACGAGCGAG ACTTGAAGAC GCCGGATAAT TGGGTGAAGA GGCATCCGTG CGTGCGAGAC 
GCGAGGCGAC GCGAACGCGG GACGACGAGG ACGCGGCGAA CGACGACGGC GCGCGTGACT
GACGGTGGTT TGCGTCGATC GTGGTGATAG GAATTTGATT CGTCTGACCG GGAAACACCC
GTTTAACGTC GAGGCGTCGC TTCCGGAGTT GTATGATTAC GGATTCATCT CGCCGGTGAA
CTTGCACATC GTGCGGAATC ACGGCGCGGT GCCGAAGTGC GATTGGGACA CGCACAAGAT
TAACATTTGT GGCAACGTCC CGAAACCGTT CGAGATCGGT ATGGACGAGT TGGTGAAAAT
GCCGAGCGAC ACGTTCCCGT GCTTGGTGGT GTGCGCGGGG AACCGTCGGA AGGAACAAAA
CTTGGTCAAG TCGTCGATTG GGTTCTCGTG GGGGCCGTGC GCCATCGGAA ACACGTACTG
GACGGGCGTA CCGCTGCGAG TGTTGCTCAA CAGAGCGGGC ATTCATAAGC CCGGTCCGGG
TGCGCGATAC GTGTGCTTGG CTGGTCCGCA AAACGAATTG CCGAAGGACT ACCCCGATCA
AGACGGTGGT CCGGGATCGT ACGGCACATC CATCGACATG GAAACCGCGC TCGATCCGAC
GTGCGATGTC ATTGTGGCGT ACGAACAAAA CGGTGCCAAG CTTCACCCAG ACCATGGATT
CCCAGTGCGG GTGATCATTC CCGGTTACAT CGGTGGGCGC ATGATAAAGT TTTTGAAGGA
GATTAAAGTC ACCGACAGAG AGTCAAACAA CTTTTATCAC TTCAACGATA ACCGCGTGTT
ACCGCCGCCA GTTGACGCTG AGCGCGCGAC CGAGGAGGGA TGGTGGTTCA AGCCCGAGTT
CATCATCAAC CAACTCAACA TTAACGGTGC CATCGCGTAC CCGGCGCCCG AAGAAGTCAT
TCCCAAGTCG CAAAAGACGT ACGCCTTCAA AGGTTACGCC TACTCTGGCG GCGGTCGCAA
GGTCATTCGC GCCGAGCTTT CCTTCGATCA AGGCTTGAGC TGGGAATTGA GTGATATTCA
CACTCGCGAA GAACCGCGAT GGGCCGATTT CAGCTCCGGT GACAAGGCCA GGCACTGGTG
CTGGTGCATG TGGACTCTCG AAGTGCCGAT TGAGAAGCTC CTCGACAAAA AGTGCGTCGA
AGTGTGCTTC CGCGCCGTCG ATCAATCC
 
Protein sequence
DERDLKTPDN WVKRHPNLIR LTGKHPFNVE ASLPELYDYG FISPVNLHIV RNHGAVPKCD 
WDTHKINICG NVPKPFEIGM DELVKMPSDT FPCLVVCAGN RRKEQNLVKS SIGFSWGPCA
IGNTYWTGVP LRVLLNRAGI HKPGPGARYV CLAGPQNELP KDYPDQDGGP GSYGTSIDME
TALDPTCDVI VAYEQNGAKL HPDHGFPVRV IIPGYIGGRM IKFLKEIKVT DRESNNFYHF
NDNRVLPPPV DAERATEEGW WFKPEFIINQ LNINGAIAYP APEEVIPKSQ KTYAFKGYAY
SGGGRKVIRA ELSFDQGLSW ELSDIHTREE PRWADFSSGD KARHWCWCMW TLEVPIEKLL
DKKCVEVCFR AVDQS