Gene OSTLU_26389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26389 
Symbol 
ID5004323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp137037 
End bp140204 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table 
GC content62% 
IMG OID640419744 
Productpredicted protein 
Protein accessionXP_001420425 
Protein GI145352162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0807646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.155521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCGC CGTCGCCGGG AGTGAGCGTG GGCGCGCTCG CGAGGTGCGG GAACCCGTTC 
GCGCACACGC CGGCGTGGCC GAAGACGCAG CGGACGACGC CGGAGACGGC GGCGGCGGCG
CGGCGCGCGA GAGGGAAGGG GTTCATGTAT TTGAGAGACG AGGACGACGA CGCGGACGCG
GTGACGCCGG TGAGTCGGCG ACGGGCGCCG GCGTCGGCGC CGGGACCGAG CGCGCGAGCG
ACGGCGATGG CGCAGAGCCC GTTCGTGATC ATGCAACGAC ACATGGCGGG ATCGGCGACG
CCGTACGCGA ATTTAGACGC GCCGGCGCCG GATTCGGTGA CGCTCACGAC GGCGAGTGGG
AAGAGCGTCA TCGCGCGCGC GAACGGCGCG ATTCGAACCG ACGAGCAACC TAGATTTGAT
GACGACGACG AGGCGTACGA AGACGCGTTC ACGTTTCGAG CCGCGATCGA GACGCCGGCG
ACGACGAGAG CGGAGAGCGG GGGCGTGTTT ACGTCGGCGG CGCCCAAGGG CTTGGTCTTT
CAAACCGGCG GTGGTGCGAG CATCGCGGTG TCTGAAGACG TAAAACGACG CGCGCGCGCG
ATGTTTGCGG ACGTTGATGA AAACGACCCG TCGTCGTCTG GTCGAGAGCG ACCGAGCGCC
GCCGCCGCAA CGCCGCCCGC GGCGAGCGGC GGCGGAGGTT TGGTGTTCCA GACCGCCGCG
GGCGCGACTT TAGAGCTGAG CGAGGCGGCG TTGAAGAAGT CGCGCGCTTT TCTCGCGGAA
GTCGGGAATG AAAACACGCC AGTGCGTGCG GCGTTCCCGC AGCCGTCGTT CAAGACGCCG
AAGTCGCTCC CGGCGGTTCG CAAGGCGCCA TCGAAGGCGC CCGGGTTCAC GCCGCCGATG
AAGAGCACGA CGGCTTTCAC GCCGCCGATG TCAATACTCG GCGCGCGAAC GGCGGCGAGT
GCACCGCGAC AAGCGAAGAG ATCGCGCCAC GATGGCACTG GTGCGTCCGT GGGTGTTGCC
GTGCACGACT TATTCGCCGC GCGCGTTCGC ATGGGCATGA GAGCGCCGCT CTGCACATTC
TTCAATAATT TGCTCCCGTT TCAAGTTCGC CCAGCGTTCG TCGACACGTG CGTCGCGACG
CTCAACGCCG ATACCGCGAA GTCATTGCGC TTGCCAAGCG TCGAACGCGG CCTGGTTGGT
TGGCGAGAGA TGCGCGAGTT GATGATTAAA GCCGGCGCGA GCGACGCGTC TCTGACGAAC
GAGTGGGTGG CGAATGCGTA CAAGTGGATC GTGTGGACAC AAGCATGCAT GGCGCGAGCG
TTTCCAGAAA AGTATGCTTT CGGCGTCTTG AGCGAATCCG CCGTGCTGCA GCGCATGTTG
TACAAGTATG AACGCGAAAT TAACCGCGCG GAGCGACCGC ACGTGAGGAG GATTTTAGAA
AAGGATGAAA ACCCAGGCGC GCCGGCAGTG TACGTCGTGA GCGCGATTCG ATCGATGACG
ACCGCTTCCG GTATCGGTAA TGTACCTACG ATGTCAGAAA TCGAAATTTC GGATGGATGG
TATAGCGTTC GCGCGCGGCT GGACGCGAAG CTTACGCGAG CGGTGCGCGA AGGACGCTTG
CGCGTCGGGT ACAAAATATT TGTTGTCGGT GCCGAGCTAC GAGGTGTCAC GGATGCGGTG
TCACCACTGT CGGACGATGC AGAGATGGCG TACGTATGCC TGCACGTGAA CGGCGCGCGG
CTCGCGCCGT GGGATGCGGC TTTGGGACGC GTCACGTACA ATCTCACGAT ACCCCTGCGT
TCAGTGGTGC CCGACGGCGG CGTCGTGCCG CGTATGTTGA TTCATGTCCG ACACGCGTAT
CCCATGATGC ATCAAGAACG TAGGGATGCA GACAAGAACG TATTGCGTTG CGAAATCGCT
GAGCGTCGAG CGCATGCTGA ATGGCAGCGC GCTCGTGACA GCGTCTTGCA CGAGTTGCAA
GACGCAATGC ACAATCGCAT CGGAGGTTGG GGCAGCGAAG TCGATCAAGA GCGAGCGATT
CGCGAGGCGC TTCAAGAGAA GAATCTATAC GATCGTCGTA CGAGCGCGGT TCTTCGTCTG
AACGTTGTCG GCTTCATGCC GTCGTCAAAG CACGAATCAT ATCGAGGGCC GATCACCCGC
GGATCGACGA GTGCGATTCT TACGATTTGG GACGCGGACG AAGCACTCGT CGATGCGGCG
CAGCCAGGAC AGTCGTTTGC GGTGACGGCA ATCAAACCAC GCGCGAACGC GTTTATCGAG
GGCGAGTTAA GTTTATCCAC CACGCGCTAC ACGCGTTGGA CCCCGATCCC GCAGGAAGAT
ATTGCCGCGC AAAAACTCGA GCACAGCGAG CATGAATGGC GCTGCTTGAG CGTGCGCGCA
GCCGTGGATT TGGCTGACTT GTCGCGATCG GGTCTAGTGC GCAAAGAATT CGACGCCGTG
GCGTGTACTT TACACTGCGG GCCTCCACGA TCGACACAAC GCGGTCGACT GTCGCAATGG
ATATTTTGCT TTGATTCATC CGTCCTCGAT TCGACGACTC CGAACGCGGC GCGTCCGCAC
CTTTTGGCGG TAGAGATTAC TGGATACGAC GATGACAGCT TTGTCAAGGC GGATGATTGG
ATGCCATCCA CGAAGTTATT CACCGAAGCG CGGCACGAGT GTGGACCGCC GCTCGTTTTG
CGAAATTTGG AGTTCCAACA TTACGACGCG GACAACGGCG TCTACGTCGC GCGCGTGCCG
ATCGAAAATG TTTCGGTGAC GTTAGCGGCG CAAACGCCAG GCATCCATCG CATTTCGGTT
GCGTCGCGCG ATTTGGAGCG ATTTTGTGGC TCGCGAGACG ATCGATCGCG CTCGATCATC
GCGTCGTTGA AGTCGCGCGC GAGAGCACTC ACGGGCGTGC CCGAAAACGC CGAGTCTTTG
GTGGACGAGA TGCCATGGGA CGAGGTTCAC GCTTCGCAAG AACCCGCGTC GCAATACGAT
CCGCCCAGAT TGAGCATGGA CGACGATTGG AACGATGAGA ACGCGCAATC GCTCGCCGAC
GAGGCCGTTC GTCTCACGCG AAGCGCGAGC AAAGCGCCCG TCGTCGTGCC AACCCCGACG
CCCCGCCGAA CGCCGCGGCG TTCTAGCCGA GGAAGCGCGA GTCGTTGA
 
Protein sequence
MPAPSPGVSV GALARCGNPF AHTPAWPKTQ RTTPETAAAA RRARGKGFMY LRDEDDDADA 
VTPVSRRRAP ASAPGPSARA TAMAQSPFVI MQRHMAGSAT PYANLDAPAP DSVTLTTASG
KSVIARANGA IRTDEQPRFD DDDEAYEDAF TFRAAIETPA TTRAESGGVF TSAAPKGLVF
QTGGGASIAV SEDVKRRARA MFADVDENDP SSSGRERPSA AAATPPAASG GGGLVFQTAA
GATLELSEAA LKKSRAFLAE VGNENTPVRA AFPQPSFKTP KSLPAVRKAP SKAPGFTPPM
KSTTAFTPPM SILGARTAAS APRQAKRSRH DGTGASVGVA VHDLFAARVR MGMRAPLCTF
FNNLLPFQVR PAFVDTCVAT LNADTAKSLR LPSVERGLVG WREMRELMIK AGASDASLTN
EWVANAYKWI VWTQACMARA FPEKYAFGVL SESAVLQRML YKYEREINRA ERPHVRRILE
KDENPGAPAV YVVSAIRSMT TASGIGNVPT MSEIEISDGW YSVRARLDAK LTRAVREGRL
RVGYKIFVVG AELRGVTDAV SPLSDDAEMA YVCLHVNGAR LAPWDAALGR VTYNLTIPLR
SVVPDGGVVP RMLIHVRHAY PMMHQERRDA DKNVLRCEIA ERRAHAEWQR ARDSVLHELQ
DAMHNRIGGW GSEVDQERAI REALQEKNLY DRRTSAVLRL NVVGFMPSSK HESYRGPITR
GSTSAILTIW DADEALVDAA QPGQSFAVTA IKPRANAFIE GELSLSTTRY TRWTPIPQED
IAAQKLEHSE HEWRCLSVRA AVDLADLSRS GLVRKEFDAV ACTLHCGPPR STQRGRLSQW
IFCFDSSVLD STTPNAARPH LLAVEITGYD DDSFVKADDW MPSTKLFTEA RHECGPPLVL
RNLEFQHYDA DNGVYVARVP IENVSVTLAA QTPGIHRISV ASRDLERFCG SRDDRSRSII
ASLKSRARAL TGVPENAESL VDEMPWDEVH ASQEPASQYD PPRLSMDDDW NDENAQSLAD
EAVRLTRSAS KAPVVVPTPT PRRTPRRSSR GSASR