Gene OSTLU_24607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24607 
Symbol 
ID5002021 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp755884 
End bp759191 
Gene Length3308 bp 
Protein Length999 aa 
Translation table 
GC content62% 
IMG OID640417442 
Productpredicted protein 
Protein accessionXP_001418100 
Protein GI145347277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.230568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACGA CGACGGCGAG GGAGACGGGC GACGCGGCGA AGGCGAGCGA AGGGAAGACG 
GCGAAGGCGC TCGCGGCGAA GCCGGCGGCG GAGTCGTCTT CGACAGCCGT GGAGCTCGTC
GCGACGGCGA ATTTAGGCGG TAAGTCCAAG GTTTTGTTTG AGATCGAAGG CGCCGGAGAG
GCGGTGGATT TAGACGGCGA TACCGGCGCC GTGGGTCGTT GGCTCGCCGA GTCCTCGCGC
GCGCTCAAGG TTGACATGAA GGGCGTGATG TACAACGCGC GCGTCGTCTC GAGCGCCGGC
ACCGTCGTCG TCGTCGCCGT CAACGCCGAC GTAGCCAAGA TCGAGAGCGT ACACCGCGAG
TTCGTGCAGC TTCGCGAAGA CCCGAGCGCG ATGGGCGGCA TGGAGAATTT CGGCGTCGGT
TCGCTCTTCG ACCAGGACGA AGACGTCGAC GACGAAGACG TCGCCGTCGG CGCCAAGCGC
AAGCGCGCCG CGGAGACCTC CAAAGCCTCG GGCGCGTCCG CTCGCAGGCC CGCGACGAAG
CGTAAACCGG CGCAGAAGCG CAAGCCAGCC GTCAGGCGCA AGAAGTAGCT CGAGTTGTCA
TTCGATTCCT TCGTCGCCCG CCTCGCGCGC CCCGGCGGTT GACTGCCAAC CTCCCGTCGC
GTCACTCCGC GTCCCGCGAG CGTCCCTCGT CGCGCGCGCG CCCGCGCCCG CGCCCGCGGC
ACGTCGACGC GCGCATCGAG CGTCGTTCGG CGCCGCGTTT GCGGACGCGC CGCGACCGCG
CGCGCGGTTT CGCGTCGCGC CGCCCGCGCC GTGTCGAGTC CGCGAACGCG GCGCGATGCC
CGCCCCCGCG GACGCCAAGC GCGTGTTCAA GCTCGCGCTC GGCGCGGACG GGACGAAGAA
ACAGCGATAC GTCGTCGAGA CGCGCTCGGC GGGCGGCGGA CGGCTCGGGG TGATTCTGAT
CGGACTGAAG AGCCCAGATC GGGTGCTGGT GAGCTCGAGC GCGGCGCGCG CGGCGCTGAA
CGCGGAAGAA TTCGTGTGCG TCGACGCGAA CGTGGAGCGA GAGCTGGTGA ACGCGCGAGC
GATACGCGTG GTGCGAGTGG AACGCGGCGA ACGGGTGTGC GTGCTGGATG ATGGTGAGGC
GACGAACGCG AACGTCGGGG CGCCGGCGAA CCGCGGGGAG ACGTCGACGG AGGGCTCGCC
GAGCGGACCG CCGCCGGCGA AGATGATGGG AGCGACGAAC CAAGACGTGT TGGGGATGTT
GAGCGGGATC ATGAAAGGGG CGAGTCCGCG GATTTTGAGG AAGGATGAAG CGGGAGGGAT
AATACAGAGC GAGCCGCCGG CGCCGACGCG CGCGGCGGGG CAACCGACGG GGCAACCGAC
TCGGACGAAG TCGCCGATGT TGACGCCGCG AACGACGCGC GGGGGCACCG CGACTCGTGG
GCCGACGTCG ACGAACGGGC AGACACCGCC GATGGGGAGT GCGTCCGGCC CTGGTGCGTC
GTCGCTGCCG CTGGGTACGT TTGGCGCCGC GTCGCCGTCG TTCTTTTCGC CAGCCTTCGC
GTCGCCGAGC GAAGCCGCGC CAGCGCGAGC GAGCGGTGAT GATTACATCG AAAGTATGAT
GAGTGATTTT TCCGCGGCGC TTCTCGGAGG TGATACAGGA GACGCGCGCG TGGGCGCGAA
CATCCCTCTC GGATACACTT CACCGTCGGT GAACTCGCAT CAACAACTCG AGCACAACGT
CCCGTCAGCG GGTGACAGAG AATTGAGTCC GTTTTTCTCT TCCGTGGACG CCTCAGCGAG
TGGTGTCCTC GGAGGCGCGC TAGAAGATGG TGTCGACGTG GCGCCGCACG AAGACATTTC
CGGACTTCGT GGCGGCGATG CAGACGAAAT CGACCTCGCC GAACTCACAG CCGGTCGATT
CGAAGCGGAT GCACTGGGTC TGGACGCCAC CCCGGCCGAA CTCACAGATC TGTTGAATCG
ATCGCGCATA CTTCGTCAAG CCACGGAGAC GTTGGGCGAT TCTGCGCTCG ACGACGATTC
CATCAAGGAG GAAGTTTCCG CGCCACCGAC GAACAGCGTG TTCCGCGATC TGCAGCGCGA
GCCGTCGGCT GGTGAAATAT GCACTGCGTT TAATTTGCGC GGGGATAAGT TCGGATGCAA
CGGGTGCGTG AGTCGACACG TGTGTCAACT CTGCGAGTCG CCGCGTCACA CGTTTGGAAT
GTGTCCGTCA CTCTACGCCA ACATTCATAA ACGCGTGGAG AACAAAGTGT GCTTGGATTA
TTACTTGAAC TCCGTGGACG ACACCGGCGG AGCTTTGCGT AACGGATGGG ATCAAGAAAA
GCACGTTTGG AATTTGTGTC CCCGCAGCGA CAAGTGCATG CTCGAGCACG TGTGCGGCGC
GTGCGGAAAG AGCGGCAACG ACGCGCATTT ACCGACGTGT CGCTTACACG CCGTCTTCGC
CGACTCTGCT CCAGTCGATT TGTGTCGAAA GTACTACTTG CACTCTCTCG GAGACTATTT
CAAGATGGAG AGCGACTCGC AAAAGTTTAA AATCGGAAGC AGCGGTGGCT GGGCGTTGTG
CGGCAAGGGC GATCGATGTC ACTCTCGACA CATTTGCGGT CACTGCGGTG AAGAAGGACG
CGGCAAACAC AAGCCAGAGT GTAAACTGCA CTCTGTGTGC GGTGTGATTC CGGACGCGAA
CAAGCAACCA TGCTTTTTAT ACTTCATGAA GTCCATGGTC GGCATTCGCG AGTTTGAAAA
GGCCCTCCTC GGCGGTAGCG CGAAGGGATT TTGCTCGCAG AAGAAAGGCG AGTGCTCTGG
ATACCACGTG TGCGGTTCTT GTGGGAAGGA AGACGTGTCG CCGTACAACG CGGAAGGCCA
CGCCGCGTCG TGTCGCATGC GAACGATTTC CAACGAAATC GATCCGACCC TCAACCCGCG
AACGAAACAA ATGCTCAAAG ATTACGAAGA AGAACTCGCA CGCGTTCGCG CGAGCGCCAA
GTCGAAGGAA ACTACCGCGA CGCAAACCAT CGGTGACGAT GGCTCGAAGG CGGAATCGAA
ATCCGATCAA GTCGACGATG ACGCTTTAAC GCCGATCGAC GTGAAACGAT GTCGACGCCT
CCGATCGGAA ATTGACGCCA TCCTTCGAGG CATCGGCGAC GACATCGATT TGCTCGTCGA
AGAAGCGGTG GAATGCGCGG GTAAGGAAAT GAAAAAGAGC GCGAATCCGA CGCACAAACT
AGAACAAATT AGAGAGAACT TTCGAAAGAT GATGTGATGT AATATGTCGA TGATGACAAA
ATATTGTA
 
Protein sequence
MGTTTARETG DAAKASEGKT AKALAAKPAA ESSSTAVELV ATANLGGKSK VLFEIEGAGE 
AVDLDGDTGA VGRWLAESSR ALKVDMKGVM YNARVVSSAG TVVVVAVNAD VAKIESVHRE
FVQLREDPSA MGGMENFGVG SLFDQDEDVD DEDVAVGAKR KRAAETSKAS GASARRPATK
LRERGAMPAP ADAKRVFKLA LGADGTKKQR YVVETRSAGG GRLGVILIGL KSPDRVLVSS
SAARAALNAE EFVCVDANVE RELVNARAIR VVRVERGERV CVLDDGEATN ANVGAPANRG
ETSTEGSPSG PPPAKMMGAT NQDVLGMLSG IMKGASPRIL RKDEAGGIIQ SEPPAPTRAA
GQPTGQPTRT KSPMLTPRTT RGGTATRGPT STNGQTPPMG SASGPGASSL PLGTFGAASP
SFFSPAFASP SEAAPARASG DDYIESMMSD FSAALLGGDT GDARVGANIP LGYTSPSVNS
HQQLEHNVPS AGDRELSPFF SSVDASASGV LGGALEDGVD VAPHEDISGL RGGDADEIDL
AELTAGRFEA DALGLDATPA ELTDLLNRSR ILRQATETLG DSALDDDSIK EEVSAPPTNS
VFRDLQREPS AGEICTAFNL RGDKFGCNGC VSRHVCQLCE SPRHTFGMCP SLYANIHKRV
ENKVCLDYYL NSVDDTGGAL RNGWDQEKHV WNLCPRSDKC MLEHVCGACG KSGNDAHLPT
CRLHAVFADS APVDLCRKYY LHSLGDYFKM ESDSQKFKIG SSGGWALCGK GDRCHSRHIC
GHCGEEGRGK HKPECKLHSV CGVIPDANKQ PCFLYFMKSM VGIREFEKAL LGGSAKGFCS
QKKGECSGYH VCGSCGKEDV SPYNAEGHAA SCRMRTISNE IDPTLNPRTK QMLKDYEEEL
ARVRASAKSK ETTATQTIGD DGSKAESKSD QVDDDALTPI DVKRCRRLRS EIDAILRGIG
DDIDLLVEEA VECAGKEMKK SANPTHKLEQ IRENFRKMM