Gene OSTLU_24513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24513 
Symbol 
ID5002044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp148338 
End bp150816 
Gene Length2479 bp 
Protein Length820 aa 
Translation table 
GC content56% 
IMG OID640417465 
Productpredicted protein 
Protein accessionXP_001417930 
Protein GI145346921 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.558259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGTC TTCGCGTGGC GTCCCTATTC TCCGGAGTCG GAGGCCTCGA TCTCGGTCTC 
CAGCAAGCCG GCCACCGCAT AGAGCTCATG GTAGAGCGCG ACGCCCACTG CAAGCAAGTC
CTCTCGGCGC GCTTTCCGGG CGTCGCGCTG CTCAATGACG TGGCGGAGGT GCTCCCGTTC
ATGCTCGAAA ACATCGACTG CGTCGTCGCT GGGTTTCCGT GCAACGACTG CAGCTGCGAG
AATCTCAAGC GACCCGGACT CGAGCTCGGC GGCGCCACGC GTTCCGTCTC GCACGTGTTT
CGCTTGCTCG AGGCGAGACG GGTGCCGTGG CTGTTGCTCG AGAACGTCGT CGGGTTGTTG
AAGTGGCACA GCGACGGCGA ACAGAGACCG GCGATCGATT ACGTAGTCAA TGAATTGGAA
AATCTCGGAT ACAGATGGGC GTATCGCGTC GTCGATCTTT TGTCCTTTGG GACGCCTCAT
AAGCGACGAC GCGTTTTCGT CGTCGCATCT TTGCACGGTG ATCCCCGAGA CGTGTTATTG
TCGCAGAGCG CGATGTGTTC GGGAGAGTGC GTCCAGCTGG GAATGAACAA CGAGTGTTAC
GAGTGCTTCA TCACTCCACC GCGAGTACCG ACAAAGATGT TTTCGGCGTC GATAGATCTC
GGGGAGAAGC GTCGAGCGCC GTGTTGCGAT ATCATGCACT GCTTCACGAC GAGCAACGGA
CGTCGTACAT GTGTCGCGAC CCAAATCGGA AAGCAAAAGG CTGAGTTGTC AATGTTGGCG
ATAGAAGACG CCGAACGATT GATGGGATTT CCTCCGGGTT ATACGGAACC CTGCTATCCG
CTCATGCGTC CCAACGAACG AGCGCCGGTG TTCGACACGG ATTTACAAAC GATGAAAAGA
TTTAGTCTCT TAGGTCTGGC GTGCAGCGTC CCACAAAGCC GGTGGCTTGG CGAGCAGTTG
AAATGCCCGT ACAACGTGAA ATTTACCTAC GATGCGCTGG CGACGCCATT CGAAAAGCCG
TGTCCGGGAC CAGCGACGCG CGATCGATCT TCAAAGGCAT GGCCGCTCGC CGCTTACAAC
ATGCTCAACG TGAACGGCGA TCCGAAATGG ACAGGTCGAC AACGCGCGCC GAACGAAGTT
TCAGAGTTTC CACTCATTCG CGGTTTCACA CCGCTCGGCG ATTTTCTTGA ATTCACAAAG
AACAAACCCG TGCGGTACGA ACTTCGAGAA GGCTACTTGC GGAGACTCGA GCTCGCGCAC
GAGAACATTG ACTCGACGAT TCTCGCGGCG TTAGACGTCA AGCGTGACTC GACACAAGTG
ACCCTGCCGT CGCCATCAAA GAAGTCTAAG ATTGACATTT TAGAAGACAT TGACGCCGAA
GACGACGAAC AAGCTGCCAA TGAGTACGAC GACGATTCGG ATGAAGACGA GGGGGAGAGC
GAAGAAACAG AGGCTGCAAA GGAAAAGAGA CATCGAGACG AAAACAACGA AAACGACTTA
GACGAACATG GCATGACGAC ACACGGCGAG TGCTCGTGGG TGAAATGGAA AGGAGCTGTG
CGAGGAAAGA CCATGTACTG GCCATGCGTC GCGCTCCATC CGCTTCGGGA TCACGCCGTT
ATCCCTGAGG GCGCACGTGT TGCAGCTTTC AGCCCCAAGT TCACCGAGGA TCATCGTTTA
GTGATCTTTT TCGACGATCG CAGATCCTTC GCATGGGTTA AAGCGTCGGA AGTTTATCCG
TTCGACAAAT TTTACGGCGA AGCGATGAAA CAACCTGTGT TTTCTTCAAA GGCGAAATTT
ACGCAGTCAG TCGAGTCGGC ACGGGCGTGG TGCAATGCGA GGAATTTAGA GGCTCCCGTC
AACCCAATCG TGCAGCGTAA CAACGCACAT CTCTTCACCG ATCCATCGCC ATGTTACGAA
TGTGACGTGT GCGTCACCGA AGCGCACAAG GCTATGGCCG ATAGCAAACC GCAGACGAGA
TCGCGGCGTT CGTCCGGCAC GGAGTCGGAG CTCGCGGCTG GTCGTTCTAA GATGAAATCA
AAGTGCGCAC AGATGAAGAT CATAGAACTC GCGAGACACG GTCAAATCGG CGCCACACTA
GCGCTTCGCA AGGACAAAGC CGTAGGCCAG AGGATAGTGG TGCTGTGGCA GCGGGACAAC
GCGTTTTACT CTGGCACGAT TACCGCCTTC GATCCGCACA CGTATTCGTT TCGCGTCGAT
TACGATGACG GCGACGTCGA CTTGAACTTC AAGCCGTGGA CTGAATCCGT CATGGTAGCG
CAATACGTCC CGTCAAACGT CGACTCGGAT ATCGCTTTGG CAAAGAAGGC CAACGCCGCG
TCCGCGCTCA AGGCGAAAAT CATCGTTCAC ACCGCCGCCG ACGCGGCGCT CGATGAAAAT
CCTCTCTGCA CGAAAACCAT GCGACGCGAT GACGCCGGCG TCGCCATCCA GCTCAAACTC
TAGTTTCCTT CTCGACGAT
 
Protein sequence
MVSLRVASLF SGVGGLDLGL QQAGHRIELM VERDAHCKQV LSARFPGVAL LNDVAEVLPF 
MLENIDCVVA GFPCNDCSCE NLKRPGLELG GATRSVSHVF RLLEARRVPW LLLENVVGLL
KWHSDGEQRP AIDYVVNELE NLGYRWAYRV VDLLSFGTPH KRRRVFVVAS LHGDPRDVLL
SQSAMCSGEC VQLGMNNECY ECFITPPRVP TKMFSASIDL GEKRRAPCCD IMHCFTTSNG
RRTCVATQIG KQKAELSMLA IEDAERLMGF PPGYTEPCYP LMRPNERAPV FDTDLQTMKR
FSLLGLACSV PQSRWLGEQL KCPYNVKFTY DALATPFEKP CPGPATRDRS SKAWPLAAYN
MLNVNGDPKW TGRQRAPNEV SEFPLIRGFT PLGDFLEFTK NKPVRYELRE GYLRRLELAH
ENIDSTILAA LDVKRDSTQV TLPSPSKKSK IDILEDIDAE DDEQAANEYD DDSDEDEGES
EETEAAKEKR HRDENNENDL DEHGMTTHGE CSWVKWKGAV RGKTMYWPCV ALHPLRDHAV
IPEGARVAAF SPKFTEDHRL VIFFDDRRSF AWVKASEVYP FDKFYGEAMK QPVFSSKAKF
TQSVESARAW CNARNLEAPV NPIVQRNNAH LFTDPSPCYE CDVCVTEAHK AMADSKPQTR
SRRSSGTESE LAAGRSKMKS KCAQMKIIEL ARHGQIGATL ALRKDKAVGQ RIVVLWQRDN
AFYSGTITAF DPHTYSFRVD YDDGDVDLNF KPWTESVMVA QYVPSNVDSD IALAKKANAA
SALKAKIIVH TAADAALDEN PLCTKTMRRD DAGVAIQLKL