Gene OSTLU_23818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_23818 
Symbol 
ID4999490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp3907 
End bp6927 
Gene Length3021 bp 
Protein Length788 aa 
Translation table 
GC content55% 
IMG OID640414911 
Productpredicted protein 
Protein accessionXP_001415358 
Protein GI145340491 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCGCGACGT CGCGAGCGGC GCAGCGATGG GTGGACGCCG ATGTTTTTGC TCGCGATCCT 
CGCCCGTGGG CGTCGGGGCG CCCGCGCCCG CGCGGCGGCG GCGGCGGTGC GCGCGGGCAT
TCGGTGCGCG GGTGCTGTTC GTGTTGATGC TCAGTGGGTG CGGGCGCGCG TCGGGTGCGT
CTGCGGCGCG CGGGCGGGCG GCGGCGGCGA CGGCGGAGAC GACGACGACG ACGACGCCCA
GGGTGCGGGC GAGCCGTGTC GACGTCTCGC ACCAGCGCTC CGATCTCGGT GATTTCATAA
GTGACTTGAG GGACAACATC AATCGCGTGG AAAATTTGTT CAACGCGGGT AAGAACGCGT
TCAATCAAGT TGGCGCGGCG TTCGACGACT TGAGCAGGGA CATACAATCC TTCCCAGGGA
AAGTTAATGG TGTGAAGAAT GAAGTTGGCA AGCTCGTCGG AACCATCGGT GATAAGGTTG
AAGGTTTCTT GGAAGAAGTG TACGGAAAGC TCTTTCCGAC AGACGCTGGT TCCATGGTGA
ACGATATCGC TGGTAAGTTT TCGCCAACGG CGTCTCTGGG AGAAGGTCGC GACGGACCGT
ATTTTGAAGA TACATCCGCG ATCGGTGGTC TCGTTCGAAG ACTCAATCAA GACGTTCTCA
GAGATGATCT TCGCCGCGTT CTGTATAACG ATTACGCCTC ACAACTTGGC GCCGCACATA
AACGCGCAGC CGAGCTCGGC TGCTCAACCT TGCCCGCAAA TACAACCGAC GGGGAGGCGG
CAAAACTTGG ATGTACTTCC ATCAAGAGTC TGAGCCAAGT ATGCTACGAC GCCCCGATGC
GCGATTTGTT TCCTGAATCG GCATTTTCAC AAGAATACGA GATGCCTTGG CCTAAGAAAT
TGGGAAGCTC TCCGTTTTCA CCCGCGAAAT TTGACGTTGG ATTTCCAGTC GCCTCGTGGG
AAACGTGCGC TGGCATAAAC AAGTTCAACA TTCCAACCAA AGTCGCCACA GATCTCATTA
ACGCGTTTCG ACTCTTCTTT GGCGCCTTGT TTGATGGCAT ACAGGAAGGC GCGACGGAGG
CGGCGACCGA GGCGAAGAAC GCAATCATGG TTCCAGTCAA CGCCATCGAC GCGCAAATTG
CAGTAATCAA CAACCACTTG AAAACAGCGC AGGACCACAT GAACAGTATT TCACGCGTGT
TCGGTGGAAG GCGTTTGCTT TCCGAAGAAG ATATCGCTGA ACATCACCAC GAACTTCGTC
GACACACGGA GGCTTTGGCT GCTTCCATCG CACATGCCGA TTTTGTGTAC AAATCACAAA
TGCAACGCAT CGAAGATCAA GTCTTGCTGG GTCTCGAACA AATCAGAGAA GTACTCATTC
TCGATCCTTT GCTCCAGGAT CCGAAGAAAA TGGCCAATTT CCCGGATTAC GGTGCCCTCT
TGGAAAGGCG CGCAAGAAAA ATGGGTGTTG AGCACGATAC CGCTGAGCTT GGCGATGCGG
TTGAGAAACT GCGATCTGCG CGACGAGCGC TCACGAGCGC AATGGCAGAA CTTAAAAACA
CCGATCTCGA TATCGGCGTC TCATCGGATC TCGAGCTCAC GCTCGAGGCG ACGCTCAAGG
GCGATGCTTT CGTACAAGGT GATTTCTTCG AGTCCATCAT GGAGAAGACC AAGAAGCCAA
TCGAGAATCC TACCGTGATA AGGAAACAAG TTATGGGCGC TTATGGCTTC TATGTCAACT
TGGCTTTTGG CGTTGGCTTC AAGTTGCCGT ACTTCGCAAA GGCTGACGCA GAGGCCAAGC
TCCGATATGG ACTTTCGATT CCTGATATGA AGATTGGAAT CAAGAGCGAA AAAGGACAGT
TCTCCGTCTA CTTTAACCCT CCGCAGCCGC ACCTCGACGA AGACGGGCTC GAAGCGTCTG
TATCGGCGCA CCTTCAAGTC GGCGCCGAGC TCGAGATACC AATGGTGCAA ATCGAGTTGT
GCTGGGCTGG AGCGATTTGC TCTGGTCCCG AAGTGTACTT CTCTCAAGGT GCTCAAGTCG
GTCTGGACAT GTTCGCAGCT GCCATGAACA CCCCACCACC GTGTTTCGAT GGTGAAACCA
CGCTTGCGAC GTATTTCACC GATTTTGACT ATCCAAGTAA GCCCGCTACG TGTTCGCTTA
CGGGTTCCGG TATGGCGGCG GGCGTTGGCG CGTACTATCA AGTACCAAAG CCTGATGCGC
TCGTCAAACT CGTCACGACG ATCAGTTCGC CGTGTGAAGT TGACGTCCCG GATGTCGTTC
TTTACAAATC CAGCAAGGAG GACGGCTTTT TCGCTCAAGG GGAAATTTTC CCACCTCAAT
GCGGCGCTGA CATCGAGGCT GGCTCGGAAC CCCCGCCAGA TAAGTGTGGG TAGAGTCAAA
CGAGAAACCT CTTTTTCATC CAACTCACCG GCCTCGCGTA GCAGCTTGCC CCGAGTAAAA
TACATACACA GAAGACATAC ATACGTGTTC CCAAACCAGA CGCTCGGCCT CCGCCGTCAG
TTGTTTCCGT CGCCGCGATG TCCGGTTGTT GCGTAGTCGC ACTCGAGGTG GACGCTCGCG
AGTGTCTCTT CGGCGGGTCC CGCCCGCGTT GTCTTTCGCC TCCTTTCCAG GCGAAAGACT
GTTTTCGAAA AAGTTGAAAG TTGAAATTGA CTGTTTGCGA AAAAGTTGAA AGTTGAAATT
GACAAAAAAG ACTGTTTTCC GAGCGTCAGC TCCTATACCG TCGCACCGTA CGACGGTAGC
ACGGACGTAC TTTTGATTGA CGCCCGCTCG TCGCGCACCG CATCTTGAGG CGTCGAAACC
GCCGAGAGTC GCCGCGCGAT GGCGAAGAGA GCGCATGCAG AGAGCGCGCC CGCGACGCCG
ACGGTATCGG CGCGCGCGAG TGACCATCCG AGCGTCGCGC CACGCGTTTT GTAAACGCGT
GCGGGCCAAG GTGTCGGTGT TACGGCACTT AGTGTTACGG ACGACATGTA TCATATCGAT
ATCAGCTGTT CAGGCGAGGT T
 
Protein sequence
MGGRRCFCSR SSPVGVGAPA PARRRRRCAR AFGARVLFVL MLSGCGRASG ASAARGRAAA 
ATAETTTTTT PRVRASRVDV SHQRSDLGDF ISDLRDNINR VENLFNAGKN AFNQVGAAFD
DLSRDIQSFP GKVNGVKNEV GKLVGTIGDK VEGFLEEVYG KLFPTDAGSM VNDIAGKFSP
TASLGEGRDG PYFEDTSAIG GLVRRLNQDV LRDDLRRVLY NDYASQLGAA HKRAAELGCS
TLPANTTDGE AAKLGCTSIK SLSQVCYDAP MRDLFPESAF SQEYEMPWPK KLGSSPFSPA
KFDVGFPVAS WETCAGINKF NIPTKVATDL INAFRLFFGA LFDGIQEGAT EAATEAKNAI
MVPVNAIDAQ IAVINNHLKT AQDHMNSISR VFGGRRLLSE EDIAEHHHEL RRHTEALAAS
IAHADFVYKS QMQRIEDQVL LGLEQIREVL ILDPLLQDPK KMANFPDYGA LLERRARKMG
VEHDTAELGD AVEKLRSARR ALTSAMAELK NTDLDIGVSS DLELTLEATL KGDAFVQGDF
FESIMEKTKK PIENPTVIRK QVMGAYGFYV NLAFGVGFKL PYFAKADAEA KLRYGLSIPD
MKIGIKSEKG QFSVYFNPPQ PHLDEDGLEA SVSAHLQVGA ELEIPMVQIE LCWAGAICSG
PEVYFSQGAQ VGLDMFAAAM NTPPPCFDGE TTLATYFTDF DYPSKPATCS LTGSGMAAGV
GAYYQVPKPD ALVKLVTTIS SPCEVDVPDV VLYKSSKEDG FFAQGEIFPP QCGADIEAGS
EPPPDKCG