Gene OSTLU_26531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26531 
Symbol 
ID5004384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp395461 
End bp398985 
Gene Length3525 bp 
Protein Length1174 aa 
Translation table 
GC content63% 
IMG OID640419805 
Productpredicted protein 
Protein accessionXP_001420497 
Protein GI145352319 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.297969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGC GCGCGACGAC GCCGACGCGA CGGGAGGAGG CGGCGTTCGC GACGACGCCG 
ACGACGCCGC GCGCGCGCGA GGACGCGCCG ACGGGGTTCG TGGAAGTCGC GAACGGCGAC
TTGGGGGCGC TGTTCGGGGG GTGCGAGGGC GACGACGATT GGTTGACGCC GACGGGGGGG
GGGGGGGAGG TCGGCGCGTC GGCGTCGGCG TCGGCGGGGG CGGAGGCGTC GGCGGGGGCG
GTTCGCGCGG CGGCGACGGC GACGGCGGCG GCGACGTACG CGACGAACGC GCCGTTCGCG
TCGCCCGCGG CGTCGCGGGA CGATGGGATG GGATTCTTTG ACGATTTAGA CGCGGCGGAG
GAGACGGTGA GCGCGCCGGC GGTGATTCCG TTCGAGGCGC GGGCGAGCGG GGCGGCGGAG
GCGCCGGCGC GCGAAACGTA CGCGGCGAGC GAGAACGTGG CGGAGGCGAA CGAGATCGCG
AGCGTCGCGG CCGAGGCGCG CGCGCCAGTT ATCGAGGCGA CGAGCTTCGC GCCGACGGAT
TCGGCGCCGG ATCGTGCGTT TTACGAGGAA ACGCACGCGC CGGCGGCTGA GTATCCGCAA
GCAGAGTTCG TCGGCGAGCG CGAAACGAGC GATGACTTCG GGATGGGGAT GACGTCACAC
GCTCACGCCA CTGCGTACGA GCAACATGCT CCGGAACAGT ACGCGCGACA GGGATTTTCC
CAACCGCTCG CGAGCGCTTC GCACGAAGAC CAGCCGAATC GCGCGCTTGA ACTGCGAGAG
CCGCCTAGAT CAGCGTACGC AGACTATCGA GAAAACGACG TCACCGCGGA CCTGAGCGCA
CCACCGATAG TGCCGGCGTA TTCTTCCGAG AATTTGCACA GCCGTGTGTC CTCGTCGAAT
TCCGTCTCCG ATATGCAGGC GCCGCCGCAG ACGCCGCCGA AGCCGACGTT CATGGTGCCG
ACGCCGATTG AAGACTCCCC GATGGCGTAC GTGCATGCCG TGCCGCATGC ATTGGCTTCA
GCGTCGCCCG TACCATTTAC GCCACCACCT CGCGCGAGCG CGGGCTACGA CACGTACGAG
CCCGCTCGCG TGGCACAGCC AGAGTATCCG CCTGTGCCTG CGCCGAGCGC GAATCATGGC
GGCTATGCGC CGTCTTACGA GGCGCACTCC ACGCAACAGC TCGAGTATGC AGCATATAGT
GAAACAGAGA CGCTCAAACC CGCAAATTAT GACGACACTG ATCGTTCTCC GCACGGCCGC
CCGCAGCACG TGGCGATGAG TTTCGGTTTT GGCGGCTCGC TCATACTGAG CGGGCCTGGG
TATCCTGGCG GTAGAATCAG TCACGGAACG AGCATACCAC CTTGTAGCTT GCGCGTGCAT
TCGGTTGGAT CCATGTTAAA GGATGGTAAC ACACTCGGTA TGTCTTACGT ACGATCGATG
GAGGCGTTCG ACGGGCCGTT GGGAAATCGA CGACAAGCCG ACGTGACAAA GATGATGGAC
TCGGCGCTCT CGTCCGGGGG AGAACGGCAA CAGAGCGAAG TGACATTATA TCGCGTGTTA
CAGACGATGT TACGACATAA GGGTGAGATT TCAACCCCTG GTGATTTGTT AGGCGAAAAT
AGAAAAGGCG CTGTCGCCGA GCTTGCATCC GTGCTCGCCG GTGACGCTCA GGCAGCCTCG
GACGGTGGTT GGGTGTCGGC AAACGTCGCG GCGTCGCCGC TGAATCCGTC GGGCGAAGGC
GACGCGCAAC AAATCGTGCA AATAGAGAAT CTCCTCATCG CCGGTCGGCG CGGTGAGGCG
CTGCAGGCCG CTGTAGCGGC GAAATTATGG CCACACGCGC TTCTTCTCGC CAGCCACATG
GGCGGTCGTC ACTATCACGA AACGGTCTCC ATCATGGCAA AGAGCGTGTG CCGAGTCGGT
TCGCCGCTCC ACACGCTCGA AGTAGTCATG GCTGGGATAC CCCAAGAGCT CACGACAAGC
GGTGTCGAGG CTGCACCGAA CGTGCACGGG ATGCAAGTGC CGGAAGTTTC CCAAATTCGA
GAACTTCTCC CGAGATGGCG CGAGCACATC GCTATTCTTT GTTCAAACCC AGCAAAAGGG
AGTGATTTCG TGCTCAAAGC GCTCGGCGAC GAGCTCTGGT GTCAGAATGA CATCACGGCC
GCGCATGTTG CCTACGCACT GTCAAAGCAA CGTCCGACGC CGTATTCATT CAATTCGCGG
TTGTGTCTCA TCGGCGCGGA TCATCGCAAG TTCCCGAGGA CGTACGTCAC GCCTCGTGCC
GTGCACCTCA CCGAGATTTT CGAACTCGCG GTGTTAGGTT CGAATCCACA AGCGCAACTT
CCATCTCTGT TGCCTTACAA GCTCTTATAC GCCGGAGCTT TAGCTGAGGT TGGTAAATTG
AAACCAGCGC TGGCGTACGT AGAGTCTGTG TTGAAGAGCG TTCGTTCTCT GGATAGAAAT
TCTCCCGAAG TGAACGGCGC CCTCGTGGGA ATGTTGGCGG CGCAGATGGA AGATCGGTTA
CACAATAGTT TACGCGGAAA AACTGGCAGA TTAGCTGACG CCGCAGCTGG CGCTGCCAAG
GTGTTGGTGA GCGGCGTCAA AGGCTTGCTC GATCGAAGCG TCAGCTCCCT ATTCGGCGAC
GGCGGTGAAT TTCAAGCCTC GCCGCTGGGT CCACCGCACG AGCCGCGACC GCACACGCCA
CCGGATGCGT ATCAGCAGCA GCCCGTGCAA ATGCACCACA CACTTTCGTC GGCGCACTCA
CAAGCTGCAC CGGTCGTGCA ACCGCCGCCG GCGCACGTGG CGCGCCACGA GCGCACGCCC
TCGGGCAACT TGTTACGATC GATGTCGTCC CTCTTCGGCG GCGTTGCGCC AAAGCCTCAG
CCCGCGAACG AGCCGACAAT GAGCCAAGAG AATGTCTTCT ATTACGACGA CGAGCGCAAG
ATGTGGCTCG AGAGAGGACG AGCACCACCG AAAGAGGCCC CACCCGTGGG TGCTCCGCCG
CTTCGATCAG AACAAAGCGC GGCGAGTGAA ATTGCAGGTC CGCCTCCAGT CATGGCGCCA
AGCACGCACG CAAAACAGCA AGGTGGCGTA CACACGCGTT ATGTGTCGAC GTTCAGCACG
ACTTCGACGC CGACTGTCGC CCCTCAAGGC TTCGTCCCCG TCGCGCCTAA CGCCGGCGCG
TGCGCGCAAG CTCCTGCGCA ATTCTTCATG CCGTCCGCCG TCGCGCCGGC CCATTCCGCG
CACTCCCGGA ATGAATCCAG CGAGAGTCAA TCGTCGGCGT CGTACGCCGC ACACGAACGA
ACCGCCTCGC AAGATGGATT TTACGGTTAC GAAGCCTCGG CATCCGAATC GTCGCATCCG
CCGCCCGCCG GCGCAATCCC TCGCCCGCCC ACCATCGACC CCGCCCTCCT CGCCCCGTCG
TTCGCGACGT CAGCACCACT CCCAGTGATT CATCACACGA GCGCTTCCGC CGCTCCATCC
GTCGACGTCG TCGCCGATGA TTTCACCGAC CTCAGACTGC AATAA
 
Protein sequence
MGERATTPTR REEAAFATTP TTPRAREDAP TGFVEVANGD LGALFGGCEG DDDWLTPTGG 
GGEVGASASA SAGAEASAGA VRAAATATAA ATYATNAPFA SPAASRDDGM GFFDDLDAAE
ETVSAPAVIP FEARASGAAE APARETYAAS ENVAEANEIA SVAAEARAPV IEATSFAPTD
SAPDRAFYEE THAPAAEYPQ AEFVGERETS DDFGMGMTSH AHATAYEQHA PEQYARQGFS
QPLASASHED QPNRALELRE PPRSAYADYR ENDVTADLSA PPIVPAYSSE NLHSRVSSSN
SVSDMQAPPQ TPPKPTFMVP TPIEDSPMAY VHAVPHALAS ASPVPFTPPP RASAGYDTYE
PARVAQPEYP PVPAPSANHG GYAPSYEAHS TQQLEYAAYS ETETLKPANY DDTDRSPHGR
PQHVAMSFGF GGSLILSGPG YPGGRISHGT SIPPCSLRVH SVGSMLKDGN TLGMSYVRSM
EAFDGPLGNR RQADVTKMMD SALSSGGERQ QSEVTLYRVL QTMLRHKGEI STPGDLLGEN
RKGAVAELAS VLAGDAQAAS DGGWVSANVA ASPLNPSGEG DAQQIVQIEN LLIAGRRGEA
LQAAVAAKLW PHALLLASHM GGRHYHETVS IMAKSVCRVG SPLHTLEVVM AGIPQELTTS
GVEAAPNVHG MQVPEVSQIR ELLPRWREHI AILCSNPAKG SDFVLKALGD ELWCQNDITA
AHVAYALSKQ RPTPYSFNSR LCLIGADHRK FPRTYVTPRA VHLTEIFELA VLGSNPQAQL
PSLLPYKLLY AGALAEVGKL KPALAYVESV LKSVRSLDRN SPEVNGALVG MLAAQMEDRL
HNSLRGKTGR LADAAAGAAK VLVSGVKGLL DRSVSSLFGD GGEFQASPLG PPHEPRPHTP
PDAYQQQPVQ MHHTLSSAHS QAAPVVQPPP AHVARHERTP SGNLLRSMSS LFGGVAPKPQ
PANEPTMSQE NVFYYDDERK MWLERGRAPP KEAPPVGAPP LRSEQSAASE IAGPPPVMAP
STHAKQQGGV HTRYVSTFST TSTPTVAPQG FVPVAPNAGA CAQAPAQFFM PSAVAPAHSA
HSRNESSESQ SSASYAAHER TASQDGFYGY EASASESSHP PPAGAIPRPP TIDPALLAPS
FATSAPLPVI HHTSASAAPS VDVVADDFTD LRLQ