Gene OSTLU_41924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41924 
Symbol 
ID5005216 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp469318 
End bp472413 
Gene Length3096 bp 
Protein Length979 aa 
Translation table 
GC content61% 
IMG OID640420637 
Productpredicted protein 
Protein accessionXP_001421159 
Protein GI145353732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.123239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.112463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGA GCGACGCGGA GGCGGTGCGA ACGCTGGCGT GCGTGACGGT GAAGCGAAGG 
TGCACGCCGA GGGCGTTCGC GTCGAGGTTG ACGCGCGGCG AGAGGGACGA GGCGAAGCGA
GCGCTTCTGG ATCGAGCGAT GACGGCGGAG AGCAAGGCGC TGAGGAATGC GGTGCTGGAC
GTTATCGCGA AAATCGCGCG TTGGACGGTG CCGCAGGGGG AGTGGAACGA GTTGTTGGAA
TTTTTGGGAC AGTGCGCGAG CTCGCCCGAG ACGGCGCATC GAGCGTTGGC GTTTAAGTTG
TTCGAGAGCC TGACGGAGAC GATCGTGAGC TCGCTGAGTC ATCACTTTAA GACGTTGGCG
GGATTGTTTG CGAACGGACT CGTGGACGCG CACGATGAGG TGCGGGTGAG CGCGCTTCGC
GCCGTCGGGG CGTTGGTGGC GAACGCGTCG GGCGAGCCCG AGGAGGTGGC GGTGATAAAG
TCGTTGGTGC CGCACGTGCT CGAGGCGGCG AAGACGGCGG TGTCGAACGA AGACGAAGAG
TCGGCATCGA TCGTGTTTGA GGTCCTAGAT GCGCTCACGG AGAGTCGCAC GAGCGCTTTG
AGTGGACACG TGCCCGCCGT CGTCGGCTTT TGCATTCAAG TCGCCACGGC GGAACGCGAG
CTCGGGACGA GCGCGCGACG ACGCGCGTTA GACGTGCTGG CGTACATGGC GCGTCACAAA
CCAAAGGCAC TGACAAAGTC TAAGCTTGTC GAGCCGATGC TGGCCGTGTT GTGCCCGCTG
TGTGGTGAGC CCAAGGAAGC CGAGCTCGCG GGCGAGGACG ATCTCGAAGA CGAAGACGAG
GTGCACATAC AAACCGTAGC GAGTCAGCTC ATTGATATTT TAGCGCTTAA AGTGCCGGCA
AAGTATGTCC TTCCGACGGT TCTGTCATTC GCCGCGGCGA ATATCAACAA TGCATCGAAC
GACCGCTTGC GTCACGCCGC AGTCGCCGTG CTCGGCGTCG TCACCGAAGG GTGCGCCGAG
GGCGTGCGCG CGCACGCGAG CACCATCGTG CCGAGTGTGG TCGGACGCTT GAGCGATCTA
AATGGACCCG TGCGAGGCGC GGCGGCGTTT ACGCTCGGAC AGTTTGCCGA GCACCTTGGG
TTGACGTTGG AAGACCCGGA CATGCACAAG CAAGTGCTGC CGAGCTTATT CACCGCGCTT
CCGGTTGAGC AAGTGAAGAG CGTGCAAGAG CGCATGATGT ATGCAATGGA TGCGTGGTTG
GAAGACGTTC AAGACGAAGT CGGCGTGTAC GTCAAACCTT TGCTCGACAT AGTCTTATTA
GCTCTCGATA GCGGTGCCAA GCGCCACGTG CGCGAGATGT TACTTTCGGC GCTCGCGTCC
GCGACGGCGT CGAGCGGGGA CAAGGTGCAT CCGTACTTGG GCGAACTCTT GCCTAGACTC
GATCGTTGCT TGTCTCTGAC GGCGGATGAA GAATTGAACG TTCGCGCGCG CGCGTTAGAA
GTGCTGGGGA TGTTGATTTC GGCCGAAGGT GGCAAGGAGG CCATGGGACC GCACGTGGAA
AACGCCATGC AAGCCGGGCT CTCTGGGTTC GAGCTCGATT TTGCCGAGCT CCGGGAATAC
GCCCACGGCT TGTTTGGCGA AGTGGCGGAG GCGCTCAAGG AAGATTTCGA TCGCTACCTC
GCCGTGTGCG CGCAGAAAGC GTTCGCGTCG CTCGAACTGG ACGATGGTAT CATGTTTGAC
AGCGAGGACG AAGCCGATCG CGAAGAGTTG GATTCGGACG ACGACGGCGA CGGCGACGGT
GCCGATGGTA TGACGAGGAA GCCGGCGGGT TACTCCATTC GCTCCGGCGT CATGGACGAG
AAAGCGTCGG CGTGCAAAGC GCTGAATTGT TATGCTTCGC ATTGTCCTCG CGCATTCGCG
CCGTACATCG CCAAGGCGTC CGAGCTACTC GGAGGCATGA CGGATTATAT GCACGAGATG
GTGCGCGTGC AGGCGCACCT TGCGCTCGCG CAAACTACGA TCGCGGCACT CAGCATCAAC
CCCGAAGGCG CGAAAGAACT GGTGAACGAC TCACTCTCGG CCACGATTCG TTGTGTTTTA
GAAGACGAAG ATCGGGATGC GGTGGCCGCG TCCGTGGAAG CCGCGGCGCT TCTCGTAAAC
ATTCTCAAAG AACATCGCGG GGTAGACGTC TCGCAGCACG TCATCGATCT CACCGCGGCA
AGTTTGGAAA TTCTAGAGGG CAACACGTTC TGTCAAGTCG AAGATGGGTA TGACAGCGAA
GAAGGCGACG AAGAAGGCGA TGAAGACGAA GACGAAGACG TCGAAGCGGG CTTGGTTGTC
ATCGAAGCCG TCGCCGAGCT CTTACCTGCG CTCGCGATGT ACATGGGAGA GACGTTCGCG
ACGCACTTTG TGCCGCACTT CAACGCTTTG ATGAAGCGCA CGGAAGAAAA TCATACCGAA
ACCGAGCGTT CACTGTGCTA CGCGACGCTC GTCGAGGTGG TGCGCGCCGT CGGTGCGCCC
GCTGCGGGAT GCGCCGTCGT TGCGCTCCCG AGGTGTTTGC GTGACGTCGC GTCTCTGGAC
GTCGGTTTGA GGCGCAACAG TATTTATTGC ATCGGTATAC TGGCGCAAAT AGGTGGTGCG
AGCGCGATAG ATTTCCACGG CGCCATCGCG GAGGCACTCG CGCCGATGAC GCGAGCGGAC
CGAGAATCCG ACGGCGGCGT TCGCGATAAC GCCGTGGGCG CCATCGCGCG TTTGCTACAA
GTCATCGACG GCGGTCATGC TCGCGAAAAC GCATCCGCAC TGCTTGATGT CGTTCTCAAC
GCGCTACCCT TGCGAAACGA TTTAGAAGAA GGCCCGGACG TCTACCATTG GCTCGCGTCG
ACGATTACGG AAAATCCAAC CTCGCTCGCC GACGCCGCGA TGACTCGCAT CGTCGGTATT
TTAGCCGAAG TGGTCACCGA TGGTGCGCTC GCACCGATAG ACACGTCGCG AATATTAGGA
ATCGCGCTCT CGCGCGCCGA AGATCAGCGC GTGCGAGCTA CGCTCTCCAG TCTGCCCGCG
CAGAGTCAAG ACGCGATTCG TCGAGCCGGG GCGTAG
 
Protein sequence
MSASDAEAVR TLACVTVKRR CTPRAFASRL TRGERDEAKR ALLDRAMTAE SKALRNAVLD 
VIAKIARWTV PQGEWNELLE FLGQCASSPE TAHRALAFKL FESLTETIVS SLSHHFKTLA
GLFANGLVDA HDEVRVSALR AVGALVANAS GEPEEVAVIK SLVPHVLEAA KTAVSNEDEE
SASIVFEVLD ALTESRTSAL SGHVPAVVGF CIQVATAERE LGTSARRRAL DVLAYMARHK
PKALTKSKLV EPMLAVLCPL CGEPKEAELA GEDDLEDEDE VHIQTVASQL IDILALKVPA
KYVLPTVLSF AAANINNASN DRLRHAAVAV LGVVTEGCAE GVRAHASTIV PSVVGRLSDL
NGPVRGAAAF TLGQFAEHLG LTLEDPDMHK QVLPSLFTAL PVEQVKSVQE RMMYAMDAWL
EDVQDEVGVY VKPLLDIVLL ALDSGAKRHV REMLLSALAS ATASSGDKVH PYLGELLPRL
DRCLSLTADE ELNVRARALE VLGMLISAEG GKEAMGPHVE NAMQAGLSGF ELDFAELREY
AHGLFGEVAE ALKEDFDRYL AVCAQKAFAS LELDDGIMFD SEDEADREEL DSDDDGDGDG
MTDYMHEMVR VQAHLALAQT TIAALSINPE GAKELVNDSL SATIRCVLED EDRDAVAASV
EAAALLVNIL KEHRGVDVSQ HVIDLTAASL EILEGNTFCQ VEDGYDSEEG DEEGDEDEDE
DVEAGLVVIE AVAELLPALA MYMGETFATH FVPHFNALMK RTEENHTETE RSLCYATLVE
VVRAVGAPAA GCAVVALPRC LRDVASLDVG LRRNSIYCIG ILAQIGGASA IDFHGAIAEA
LAPMTRADRE SDGGVRDNAV GAIARLLQVI DGGHARENAS ALLDVVLNAL PLRNDLEEGP
DVYHWLASTI TENPTSLADA AMTRIVGILA EVVTDGALAP IDTSRILGIA LSRAEDQRVR
ATLSSLPAQS QDAIRRAGA