Gene OSTLU_23932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_23932 
Symbol 
ID4999774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp770595 
End bp773650 
Gene Length3056 bp 
Protein Length1009 aa 
Translation table 
GC content60% 
IMG OID640415195 
Productpredicted protein 
Protein accessionXP_001415928 
Protein GI145341670 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACGGC GACGACCGGC GACGGCGGCG CGGGCGATCG TGGGGGGGGT GCTCGCGCTC 
GCGAGCGTCG CGGTGCCGGC GCGAGGGGCG TTCAGCGTGA TGCCGTCGAC GGGGACGGGG
ACGACGGTGA CGTACGCGGG GACGGCGGCG TTTCGCGAAC GCGTCGTGGA CGCGTGCGGG
ACGTGCGGCG GGGACGGCGC GGCGTGTCAG GGATGCGATG GCGTGACGAA CAGTGGGAAG
GTGTTCGATG CGTGCGGGGC GTGCGGGACG GCGTGCGATA AGAGCGATAC GTCCATAACG
TGTACGTTTA ACGCCACGTG CGAGGACTGC GCGAGCGTCA CGGGCGGCGC CGCGACGACG
GACGCGTGCG GGAATTGTAA AGCGCCGACG GACGCGACGT TTTCGCGAGA GGGGGTCCCG
GATCATCATC TCGGTGGCTG CGTCGGGTGC GATGGAGTCG CGAATAGCGG AGCGGTGATG
GACGTGTGCG GGGTGTGCGG CGGGAACGGG TGCTCGGGAA CGGATCCGTC GCTGTGGAGC
TGGTGCTGCG ATTGCGCCGG CGTGGCTTTC GGGACGAGCG CGCAAGATTT GTGTTGTGAA
TGCATCGACG GCGCATCGTA CTACCCCAAC GGTGTGAAAC CAGCCGAGCA CACGCAGATA
GAAAATTTAT GGACGCAAGG GCAAGAGGCG TTCGCGGCGG CGGCGGCGAC GATGACGACG
TTTCGCGCGC TGGTAGAACC AAGACACAAG GCTGCGGCGC AGACGCTCTA TGAACAGGCA
GAACAGGCGT TCAAAGATGC ATGGGCGCTC GTCGCCACGC AGGCCGCTGT TCCAGGTACG
ACGATGTGCT ATCCAGAGCT CACGGCTTTA CCGCAAGTGA CAAACAATCG CGACGCGTGC
GGCGTTTGCA AGGGGCAGCT TTCGTCTTGT TTAGGTTGTG CGACGAGCGC GACGCCGCTT
CCGGTAGGGC CGTTAGAGGG CTTGTATCCA GATGATTGCG ATGTGTGTGG TGGATCGACT
GAGGTGGATG TGTGTGGGAT TTGTGGAGGG AGTAGCAACG GCTTGGATTG CGTCGGTTGC
GACGGCATCG TCGCGAGCGG TAAAGTGCAA GACGCGTGCT ACGACTCTGA CGACACAAGA
ATGTATAGTA TCGACGCCGT TTCGGGAGCG AAGACGCTGC TCGTACAAGT TGGCGAGGTT
GGTAGCGGGT GTAGCGACCC CGATGCTTTC ATCACCGCGT GTGCAGCGGG GGGCGGCGGA
TGTTGCGGTT GTGACGGCGT GCCGAATAGC GGTAAAACGC TCGACGCATG TGGCTCGTGT
TTAGCGGCGG ATGCCACGGA TAGGAAGACC AACGCGAATG CGTGCGAAGA AATCTTCCTC
GTTAAACTTC CAAGCGGTGT AGTGATCGGT CCATTCACGA AAGAACAGAT CCGAGGAGGA
TCTTTGACGT ACACCGAATC GTCGACGAAC ACGGAGACGA TCTACACCAT CGAGCCGGAT
TCGCAGATTG CTAACGCTGC AGTTATTGAT GTCACTGAAA GCGTCGAATA CGGCTACACT
TCCGTCGAGC AAGAATATTT AGATTCTTGG CAGTCGATTT TTGTGAGCGG ACGAGCGGAT
GTGATCAGCT CCACGAGCGT GCCGAACACG GTGAACACAG TCGTCGTCAC TGGTAGTCGA
GTGACGAATA CGAGCGAATA TCAAGCGTTG CAAGTGAAAC CGGAATTCGT CGGTGTTATT
TATCCGTTTT GCACGGGGGC GACGTATCGG GCTGGTCATA GACTGCACGG CAAGAAAACG
GGGTCGAACT ATTTAGGAAT CATCACCACG AGCGCGAGTA TGCCGGAAGA CTGGACCGAG
GCACAGTCAC GTCTCGATCG TGACTGGAAC CCGTATTCGG AGAGAGTGTT GGACTACGTC
GCAGAATGGG CCGATCAGCG GTGTACGTGC GTCGCCGACT GGAGAACTGA ACCCGAGCCC
GTACCTCCAA CATGTGAGCG GGTTTGGCTG CAAGCGAGCG AGGAAAAGAA AAGTTCAAAG
ATGGAGCGGA GCTGGGCGAG CGGCTCGTGG CGCGTTACGG GTGGTTGGTG GACGCAAGAC
TTTGGGCAGA AAACGACGAC CACGCAAAAT GCTGCTCGCG GACCAAGACG TATAGACTCA
TTTGGGACGC TGCGCACACG ACTCTCGCGT ACGGCGAGCC AAGAAACCGA CGAGCTCGGG
GCGTGCGACT ACAAAGCACT ACGGGTCATC GATCCCGTGC GAAACGCGAG CGGCGCGGTG
TGGTATCCAC AGCAACAGCA AGTCGCGCAA GGATTCAACG TGACCTTCAA GTTCATGATC
ACGCAACCGA CAGTCACATG TGACTACGCC GAGAGCGTCA GCGGTGCGTT CGTGCAGTCT
TTACACACTA AGCTCTATGA AAAGTGTACG ACTTCGGGCG GCGACGGCTT TGCGTTTGTC
ATTCGCGACG ATAGCGCTTC GGCGCCGGGA GCGACAGACA TCGGGTTCGA TGGTCCCGGA
TTGGGATACG GTGGCATCAC CAATTCGATT GCGTTCGAGT TCGACACCGT GTTCACCGCA
GCATATAACG AACCACGTGA GAGTCACGTC GCGATTCACA CTCGCGGTAA ATCCCCAAAC
ACGGCGCATT CGGCGGCGAG CCTCGCGACC GTCGCTCTTG ACGGTTCCGT GCCCACGACG
AGCATCACCG ACGGTCAGAT TCACGAAGTC TTCATCACGT ACAAGCCCAA CATCACCTCT
GAGGAGATGT TTTTCGCCAT CGAGTCCGGT GAAATCACCG GTTTGTCCAC CGCCCTCAGC
GCGCACACCG CCGACTCCCT CGGAGTCGTC TCCGTCTACC TCGACGACAT GTCCTCACCG
CTGATGAGCG TCCCTTTCAA CATCGAGAGC ATCTTGCGCG ACTCCGCCAC GAGCGGCAGC
GCGTGGGTCG GCTTCACCGC CGCCACGGGA GATCTCTGGC AAGCCGTCGA CATCTTAGAG
TGGAACATGA CCTCGGTCTC CGTCTCGTGA CGGCGGCACG GTCAGACGGT TGTAAC
 
Protein sequence
MGRRRPATAA RAIVGGVLAL ASVAVPARGA FSVMPSTGTG TTVTYAGTAA FRERVVDACG 
TCGGDGAACQ GCDGVTNSGK VFDACGACGT ACDKSDTSIT CTFNATCEDC ASVTGGAATT
DACGNCKAPT DATFSREGVP DHHLGGCVGC DGVANSGAVM DVCGVCGGNG CSGTDPSLWS
WCCDCAGVAF GTSAQDLCCE CIDGASYYPN GVKPAEHTQI ENLWTQGQEA FAAAAATMTT
FRALVEPRHK AAAQTLYEQA EQAFKDAWAL VATQAAVPGT TMCYPELTAL PQVTNNRDAC
GVCKGQLSSC LGCATSATPL PVGPLEGLYP DDCDVCGGST EVDVCGICGG SSNGLDCVGC
DGIVASGKVQ DACYDSDDTR MYSIDAVSGA KTLLVQVGEV GSGCSDPDAF ITACAAGGGG
CCGCDGVPNS GKTLDACGSC LAADATDRKT NANACEEIFL VKLPSGVVIG PFTKEQIRGG
SLTYTESSTN TETIYTIEPD SQIANAAVID VTESVEYGYT SVEQEYLDSW QSIFVSGRAD
VISSTSVPNT VNTVVVTGSR VTNTSEYQAL QVKPEFVGVI YPFCTGATYR AGHRLHGKKT
GSNYLGIITT SASMPEDWTE AQSRLDRDWN PYSERVLDYV AEWADQRCTC VADWRTEPEP
VPPTCERVWL QASEEKKSSK MERSWASGSW RVTGGWWTQD FGQKTTTTQN AARGPRRIDS
FGTLRTRLSR TASQETDELG ACDYKALRVI DPVRNASGAV WYPQQQQVAQ GFNVTFKFMI
TQPTVTCDYA ESVSGAFVQS LHTKLYEKCT TSGGDGFAFV IRDDSASAPG ATDIGFDGPG
LGYGGITNSI AFEFDTVFTA AYNEPRESHV AIHTRGKSPN TAHSAASLAT VALDGSVPTT
SITDGQIHEV FITYKPNITS EEMFFAIESG EITGLSTALS AHTADSLGVV SVYLDDMSSP
LMSVPFNIES ILRDSATSGS AWVGFTAATG DLWQAVDILE WNMTSVSVS