Gene OSTLU_17948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17948 
SymbolSDG3506a 
ID5005051 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp516947 
End bp518632 
Gene Length1686 bp 
Protein Length503 aa 
Translation table 
GC content58% 
IMG OID640420472 
Productpredicted protein 
Protein accessionXP_001421172 
Protein GI145353759 
COG category[R] General function prediction only 
COG ID[COG2940] Proteins containing SET domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CAAAGTACGA ACGCGAAGTG GTTTGGGTCA AGGTGCGACG ACAGCGCGCG 
CTCGCACGAC CGCGCGCGAG TGAAAACGGT GATTTCCACG AGTTCAAAAA TCAACGACGG
GCGAGACCGA CGTTGAATCG GTGCTGAATC GATGCGACGC GTAGAGGCGA AAGACTGACC
GAGGGACGAC GACCGACTCT CGACGCGTCG GCGCAGTACG GGAAAGGGGA GCGGTTTTGG
CCCGCGCACG TGGTGGGACG AGACGCGGCG CGCGCGCGCG TGCCGGGCGC GGCGGCGGTC
TTTGAAAAGC ATCCAGGCGC GCGCGCGTTT CAGTACTTTG GCTACGGCGG GTATTACCAG
CTCGAGAAAC CGAACGCGCC GGCTGCGATA GCGTGGGCGG AAGGGTTAGC GCGACGGTTG
GATGAAAAAG GGGTGGCCAA GGCGCACAAG ACGGCTGTGA CAAACGCGCG GGCGTACGTA
GAGCGAGGCG ATTTGACGAG CGAAGTCGCG GCGGGCGCGC TTTGGTGGAG CATGCCGCTT
CCAGCACCGC GCGAAGAGCC GGCGAAGAGA ACTCACGATT CGGCGGGGAC GTTGTCTGAG
AAGAATAAGA AGCCGCGACG CGGCGACGCG GCCAAAGATA CATTCGACGA GCGTGAGGAA
GTCAAAGACG CCGGCGCCGA CGAGTTTGAG TTTCCTCGCA TGTCCAAAAC GTTCGTCGGC
GGCGAACGCA AGGAATACAC GCCCACGTTG AGCGTGCTTT CGCTCGGAAA ACCACCTCCT
TTTGAGCGCA TCCATCGCAG CGTTTTCGTC AGCAGACCGC CGCCGGTGAA ACTGCACAAG
TCTGAAACCG CGGTGTGCGA CTGCCATCCG CCGCCGTCGC GCGGCGACAG CGAGACGATT
CGCGACGGAT GCGGGCAAGA GTGCTTGAAT AGAAAATTGC GATTTAGTTG CGACAGCCGA
ACGTGTCCGT GTGGGGACGC GTGCAGTAAT CGCCCGTTGA GTCAGTTACC GGCGCCAAAG
ACGAAGATTA TTCGCACAGA AAACAGAGGT TGGGGATTGA CTTTGCAAGA GCCCGTGCGC
GCGGGAACCT TCATCGTTGA GTACGCGGGT GAGATTTTAG ACGAGCACGA ATGCGCCGAA
CGGCTTTGGT ACGACAAGCA GTCGGGGGAA GAGAACTTTT ACTTGATGGA AATATCCGCA
AACTACGTCA TCGACGCCAA GTTTAAGGGC TCGATCGCGA GATTTATCAA TAGCAGCTGT
CACCCAAACT GCGAAACGCA GCGGTGGGTC GACGCTTCGA CGAACGAGAC GAGAGTCGGT
ATCTTTGCCA CCGAAGACAT CGCGAGTGGG ACCGAGCTGA CGTACGATTA CAACTTTGCG
CACTTTGGCG ATGAAAAGGG GACGTCGTTC GTGTGCATGT GTGGGCATCC CAAGTGTCGA
GGCACGCTCG ACGCGGCGAA GACGTCGAAA AAGAATTTGC ATCGCCGACT TCGCGTGGAA
ATCATGGTGA ATGGGAAAGT CGTTAAGTCG CGCAAGAAGA AACAAAAAGT CAAGGCCACT
GTGGTGGACT ATGATGCCGC GAAGAATAGA TACAAAGTAC AAGTCGAAGG TGACGAGAAG
GAAACCTTCG CGTGGGTGCG TCTCGATGGC GAAGGCGCAG CGAAACACTC GTGGCTGAGC
AAATAG
 
Protein sequence
MTTTKYEREV VWVKYGKGER FWPAHVVGRD AARARVPGAA AVFEKHPGAR AFQYFGYGGY 
YQLEKPNAPA AIAWAEGLAR RLDEKGVAKA HKTAVTNARA YVERGDLTSE VAAGALWWSM
PLPAPREEPA KRTHDSAGTL SEKNKKPRRG DAAKDTFDER EEVKDAGADE FEFPRMSKTF
VGGERKEYTP TLSVLSLGKP PPFERIHRSV FVSRPPPVKL HKSETAVCDC HPPPSRGDSE
TIRDGCGQEC LNRKLRFSCD SRTCPCGDAC SNRPLSQLPA PKTKIIRTEN RGWGLTLQEP
VRAGTFIVEY AGEILDEHEC AERLWYDKQS GEENFYLMEI SANYVIDAKF KGSIARFINS
SCHPNCETQR WVDASTNETR VGIFATEDIA SGTELTYDYN FAHFGDEKGT SFVCMCGHPK
CRGTLDAAKT SKKNLHRRLR VEIMVNGKVV KSRKKKQKVK ATVVDYDAAK NRYKVQVEGD
EKETFAWVRL DGEGAAKHSW LSK