Gene OSTLU_19135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19135 
Symbol 
ID5006671 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp510672 
End bp513769 
Gene Length3098 bp 
Protein Length980 aa 
Translation table 
GC content58% 
IMG OID640422092 
Productpredicted protein 
Protein accessionXP_001422613 
Protein GI145356799 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.439366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTGGC AGTGGCACGG CGGTTCGTTA GGTCGCGATG GAGCGCTGTA CGCGGTGCCG 
TGCAACGCGT CGTCCGTCCT GCGGGTGTGC ACGAAGACGG AGGAGGTGAG TTTCATCGGC
GGCGACGTGA TTTCGCCGAT GAAGAATAAG TGGTACGGCG GCATTCAAGC GCCGGATGGT
TCGATTTACG GCGTGCCGTA TTGTTCGGAT AAAATCATAC ACATCGTTCC AGAGACACAA
TCGGTGGAGA TGCTCGAACT ACAAGGAGCG ACTTTGGAGC CGAATAGCTA CGCTTGGCAC
GGCGGAATCT TAGCGCCAAA TGGCTGCATT TATTGCTTTC CCAGTCACGC TCGTCGCGCG
ATGAAGATTG ACTGCGCCAC GCGCACGTGC ACGCTCATCG GCGACGATCT CGGAGACAAG
CGGTACAAAT TTGGCGGCGG ATGCGTCGGC CCCGACGGCG ACTCGGTGTA CGCGTTTCCG
AGCGATTACA AGGCGGTGCT GAAAATCGAC ACGAGAACCG ACCAGACTTC TTTAGTGGGC
GAAGGACTAC CCGGAATGCT CCCAGACTTG CTTAACAAGT GGCAAAACGG TGTTCTCGCG
GGCGACGGTT ACATATACGG AATCCCGTGC GACGCACCGT CTGTGATCCA AATCGACCCC
GGGACGGATT CGGTGCACTT CCTCGGCAAC CTGGGCGACT TACCCGACAA ATACCAAGGC
GGTTTCTTAA ATCGCGACGA CGGCGTCGTG TACTGCATCC CCGAAAACGC TGAAAACGTC
ATGCGCATCT GCCCCGTCGG CAGCGACGCG CACCCACCGC CCCGCAGCGG CTTCGATCAC
GACGCTCGTC CCGAAAGTCG TCGCGCGGCG GCGTTGTAAC TAAAGTGTGG TTTCTGAGCC
ACTTCCCGTT ATCCCTTCGC GCGTCTCCGC AGACGCGCGC GCGCGACCGC ACGTGCGCGC
CGGCGCGTCG ACGCCGATCA CGCGCGCGCA CCATGCTCCC GTCCTGGCTC GCCGGGAGCT
CGAGCGATGA GTACAGCGGC GACGAAAGCG AAGGCCCGCG ATCGCCGTCG AGGTTGAAAC
AGATCGGTTC GACGGTGAAA CACGAGGCGA AGGACCGCCT GACGAAGCTG GCGCGAAGCG
GAACCGCGAG CACGGCGGTG GATGGACTCA CGGATTTCAT CGCGGACCCG AAATCGCACG
CGGCGGCGAC GAAGATTGGA AATTTGAAGG AAAAGTCCAA GACGCTGTGG ACGAACGCGA
TGCGGGACGA CGAGGCGAAG ATTGAAAGGA CGAAAAACGC GACGAAAAAG GCGCTGGAGA
TCACGGAAAA GTTATGCGTG AACGCGGATC CCGATGGTGA AGTCGCGCAC ACGTTTGCGG
ACGCGGCGAG ACGGATGAAG GATATCGTGG ATACGGGGAA AAACCGCGAA GAGGTGCTCG
AGCTGAGCAA GACGCTGGGG AAGGATGTGT GGGAACGGTT GAGCACGCGA GCGAAAGGAA
ACGAACACTT CACCGGTATG CGGGGAACCG TGGAAAGGAT CGTGAACAGG GTGAAGGGGC
TGATGCAAAA GTTGCAGGAT GAAAAGATGA AGCGAGACGA GGAGGCGGCG ATTTCCGCCG
TGTTAAACAG CGAGGCGGCG GTCGACGAGA TTGATTCGCG GCTTAAGACG TTGGCCGAGG
AGAGTAAAGA TGTGTGGAGT GATTTAAAGG CGGATACGCA ACTTAGAGCG TTGATTAAGG
AGGAAGTCGT GCCGGGCTTC GAGCGTTTGA TTCGCGGAGC GGTGCAGGTT TCGTGCGAGC
TCATGTCCAA ACTCGAACTC CCGCGCGTGG ACGGGGTGTA CGATTCCCCG ATCGGATCGG
TGTGTTATCA CGTAGATAAC GTGCACTTCA CCGAGTTCCA CGTGTCCAAA GAGTCGTTAC
GGGTGATCAA TCACATGGAT GAGGACGAAC TCGGCGCTGG CTTGAGCACG ACGGTGGAGG
TCAGAGACAT AAACACCGTC ATGCAAAATG TGGAGTTCGC GTATTGCGAA TTTCCGCGAA
ACTGGGGGGT TGTGGACGGT GAAGGATTAT GTACAGTCAC AGTGGACGGC GCGAGCGTGG
GAATTTCGTA CGAAATCATC ATCAACACGA ATCAGTTGAT GAAACTCGTG AACCAAGGAG
TCGAGTTGGC CAAGGACGAC GGGAAAATTG CGGAGATGAG AGAGAAGATT GAAGCCAAGA
TGAAGGAACG TAAAGAGGCG AAAGAGCGAG AGGCGGCGCG GGCGCAGGCG GCGAAACCGG
CGGCGACAAC ACCCGTTAAA ATCGAGAAGT GCGCGTCAAT CGATTCGGAC CGCGAAGAGT
TCGGTTCACC CACGGGAGAG GTTGATGCGC TAGGTTCCGC GCTCGATCGC GCGTTTGGTG
GTGGTATGTT CAGCGACGCG CACGACGATG TAGACTCGCC GCCGTTATCG CCGACGTCGC
CGACGTTTCA CGACGCCACG GACGACGCTC GTATCTTGGG GGCCAAGGAA AAGCTGATTC
GCAATCGCAA GGTGGTGAGT GAAATCTTAG GTGACGACTT TTTGGGCGAA GAACCCGTGT
TAGAGCTTCG CGTACACACG ACGCACATCT CAGTCGGCGA GCTGGACGTG CAAATTAGCG
GCACGTCCGC GGCGTGGTTG TACAACATGA TTGCTCTCGT CCTGACGCAA CAGCTTCGCG
GAACGATTGA GGAGAAAATC AACAACATAA CGGTCAAACA GCTCGCGCGA GTGAGCGGTG
CCGTTTCCGC GTACAGCGCT GGTCTAATTG AGGTTTCCAT CTACCAAGAC GACGAATCGG
ACGACGACAT CGGCTCGATG TTATCCGGCG TCACGGGCTC GCTTCGCGAG AATCCGGGCG
TCTGGGGCGA AGACTGGAGA TGTTCGCACT GCCCAGGTGA AGCCCCCGAA CACGTCGAAG
CGCGACGCAA GTTTGGTTCT CGATCGCATT CAAAAGCCTC TTTCGAGGGT TTGGACGAAA
TCGTCGAAGA AGCTGACGCG ATCGAGCGCG CGGAGGCCGC GCGCCGGTCG CTCGAAGACG
AGGACGTCGA CTGGCACGAC ATCGCGAGTC CGATGTGA
 
Protein sequence
MKWQWHGGSL GRDGALYAVP CNASSVLRVC TKTEEVSFIG GDVISPMKNK WYGGIQAPDG 
SIYGVPYCSD KIIHIVPETQ SVEMLELQGA TLEPNSYAWH GGILAPNGCI YCFPSHARRA
MKIDCATRTC TLIGDDLGDK RYKFGGGCVG PDGDSVYAFP SDYKAVLKID TRTDQTSLVG
EGLPGMLPDL LNKWQNGVLA GDGYIYGIPC DAPSVIQIDP GTDSVHFLGN LGDLPDKYQG
GFLNRDDGVV YCIPENAENT RARDRTCAPA RRRRSRARTM LPSWLAGSSS DEYSGDESEG
PRSPSRLKQI GSTVKHEAKD RLTKLARSGT ASTAVDGLTD FIADPKSHAA ATKIGNLKEK
SKTLWTNAMR DDEAKIERTK NATKKALEIT EKLCVNADPD GEVAHTFADA ARRMKDIVDT
GKNREEVLEL SKTLGKDVWE RLSTRAKGNE HFTGMRGTVE RIVNRVKGLM QKLQDEKMKR
DEEAAISAVL NSEAAVDEID SRLKTLAEES KDVWSDLKAD TQLRALIKEE VVPGFERLIR
GAVQVSCELM SKLELPRVDG VYDSPIGSVC YHVDNVHFTE FHVSKESLRV INHMDEDELG
AGLSTTVEVR DINTVMQNVE FAYCEFPRNW GVVDGEGLCT VTVDGASVGI SYEIIINTNQ
LMKLVNQGVE LAKDDGKIAE MREKIEAKMK ERKEAKEREA ARAQAAKPAA TTPVKIEKCA
SIDSDREEFG SPTGEVDALG SALDRAFGGG MFSDAHDDVD SPPLSPTSPT FHDATDDARI
LGAKEKLIRN RKVVSEILGD DFLGEEPVLE LRVHTTHISV GELDVQISGT SAAWLYNMIA
LVLTQQLRGT IEEKINNITV KQLARVSGAV SAYSAGLIEV SIYQDDESDD DIGSMLSGVT
GSLRENPGVW GEDWRCSHCP GEAPEHVEAR RKFGSRSHSK ASFEGLDEIV EEADAIERAE
AARRSLEDED VDWHDIASPM