Gene OSTLU_31612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31612 
Symbol 
ID5001696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp228498 
End bp230386 
Gene Length1889 bp 
Protein Length571 aa 
Translation table 
GC content58% 
IMG OID640417117 
Productpredicted protein 
Protein accessionXP_001417957 
Protein GI145346978 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5384] U3 small nucleolar ribonucleoprotein component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.776736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00187106 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GAAGACGAGG GCGAAAGCGA CGAAGGCGGC GGCGAAGACG AAAGCGAAGA CGACGACGAA 
GGACGGGACG ATCGACGCGC GAAGAGCGGT AAGAAATCGA GCGGAAGTGA TCGCACCGCG
CAGATCTTTC AAAAGCCTGG AATGTTTAGT TTGGACGAGA TGGAGAGCTT CATGGATCAA
GGAGACGTCG AGGAGGAGAA GCGCCGTCGC GCGAACGAAG AAGTAGACGG CGGCTCGGCG
GACGAAGAGG GCGAGTCAGA CGACGATATG CTCGACATAT ACGGAGACAT GAGCGACAGC
GACGAAGATG GGGACGAAAA CGAGAACTCT GACGAGGAGG ATGACCTTGA TGACGCGTTG
GCGTACACGG CGAAACTCGC CGGCGTGAGC GCGAGCGCGC GCAAGAAAAA ATCGACGAGG
AAAGCCACCA AGGGCAAGAA GGCGCAAGAT TTAATGTTTG AAGACTTTTT CGGCAAGCGC
CAGGGGCAGC CTCTCGGTGG GCGCAAGGGT GGCAAGCTCG GCGCTTCGAC AGACGCTGAA
CTGAACGAGT TGAGCGACGA GGAGGAGGAC ATGTTCAACG AACTTGAGGA CGGCGAAGAC
GACGACGAAG AGGATGACGA TTTGGATGGG GAGTTGGAGA CCGGTATCGC CGGTAGTCGT
GGAATCGAAA ACCATGACGA TGACGCGGAC GAGTACGACG ACGAGGAAGA AGAGGACGAA
GAAGACGAAC TCGTCGCTGG CGACGAAGAA GAGCTCGATG AGTTGGATAG AAAGCTGGAC
GCGGATTTGG ACGCCGAGCT TGCTCGTGCG GAGGCGGAGG GCGAAGACGG CGATAGCGAC
GCCATCGACG ACGACGAGAA GGAAGTGCCG CGGTCGGGTC CGAAGTCGGC GTTCCAGCGC
CAACAAGAAG CGCTCGGGCG CCAAATCGAA AAGCTCGAAG CCGCGGCCAT CGGCGAAAAG
TCTTGGCTTC TCAAGGGTGA AGCCGCAGCA AAAGAGCGGC CGATGAATAG CGCGTTGGAA
ACTGATCTCG AGTTCGAGCA CGTCATGGCA CCTGCGCCGG TGATCAGCGC AGAGATAACC
CAGAAGCTCG AGGAAATCAT CAAGCAGCGA ATCATCGAAG GCCGATTCGA CGACGTCGAG
CGTGTCGAAC CCGTGGAGGA GCGCGAACGC AAGGAGCTCC CACAACTCGA CGACACCAAG
TCTACCAAGG GTTTAGGTGA TATTTACGCC GATGAGTACA TGCGTCAAAA GGCTGGCGTC
GCGCTCGGCG AGAAGGAAGA CCCCATGGTT GCCGAGGTGA AGAAGCTCTG GGCCACGCTT
TCGTATCGCT TGGACGTGCT CTTGCAAACG GGCGAGGTGG AGGATCCGAA GGATCTCGAA
AAGAAGATCG ATCGGGAACT CGCCGCGCGT GCGAAGGGGA ATATTGTGCC GTTGACGTTC
GACGAGTCCA AACGTCTGGC GCCCGAGGAA GTCTTCGCGG GAGGCGAAGG CAAGGGCGGA
CAGCGCGGCT CCGCCGCTGG CGCCGTCAAG GCGGACGACG AACTCACCAA GGAGGAACGC
AAGGCTGGTC GTGCCAAGCG AAAACGGAAG TCCAAAGCCG CGCAAGAAGA GAAAGATCGC
GTCAAGGCCA AACGAGACCG CGCACGCGAG GCCCAGCACA AGGCAGAAGA AGATGCTGGT
TTCACGCGCA AGGCGCCGAA AGTTGCGATG CTCGCCGTCG GTTCGGCGGC TGGGAAATCC
AAATCCGATT TCTCCAAATC GAGCAAGGTT TTCGGCATGC TCCAAGATGC CAAAGACGCA
GACGCCGCGC GCGGCGGTGT GGCGAAGAAG AGCAAGTCAG ACTCCGCGAA GAACAAGCCA
TCTCTCAAAC TTTGAGTATA GTATTGTAT
 
Protein sequence
MESFMDQGDV EEEKRRRANE EVDGGSADEE GESDDDMLDI YGDMSDSDED GDENENSDEE 
DDLDDALAYT AKLAGVSASA RKKKSTRKAT KGKKAQDLMF EDFFGKRQGQ PLGGRKGGKL
GASTDAELNE LSDEEEDMFN ELEDGEDDDE EDDDLDGELE TGIAGSRGIE NHDDDADEYD
DEEEEDEEDE LVAGDEEELD ELDRKLDADL DAELARAEAE GEDGDSDAID DDEKEVPRSG
PKSAFQRQQE ALGRQIEKLE AAAIGEKSWL LKGEAAAKER PMNSALETDL EFEHVMAPAP
VISAEITQKL EEIIKQRIIE GRFDDVERVE PVEERERKEL PQLDDTKSTK GLGDIYADEY
MRQKAGVALG EKEDPMVAEV KKLWATLSYR LDVLLQTGEV EDPKDLEKKI DRELAARAKG
NIVPLTFDES KRLAPEEVFA GGEGKGGQRG SAAGAVKADD ELTKEERKAG RAKRKRKSKA
AQEEKDRVKA KRDRAREAQH KAEEDAGFTR KAPKVAMLAV GSAAGKSKSD FSKSSKVFGM
LQDAKDADAA RGGVAKKSKS DSAKNKPSLK L