Gene OSTLU_49676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49676 
Symbol 
ID5001729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp460865 
End bp462395 
Gene Length1531 bp 
Protein Length499 aa 
Translation table 
GC content56% 
IMG OID640417150 
Productpredicted protein 
Protein accessionXP_001417778 
Protein GI145346608 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0207] Thymidylate synthase
[COG0262] Dihydrofolate reductase 
TIGRFAM ID[TIGR03284] thymidylate synthase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.21688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATG GAAAGTTTCA AGTCGTCGTC GCGGCGACGC GCGAGGGAGG GATCGGGAGG 
GAGAACGCGC TGCCGTGGCG ACTCGCCGGG GACATGGGAT ACTTTAAAAA GATTACGAGC
GAGACGCGGG ACGGGGACGC GATGAACGCG GTGGTGATGG GGAGAAAGAC GTGGGAATCG
ATTCCGGGGA AATTCAGACC GCTGCCGGGA AGGTTGAACA TCGTGCTGAG TCGAAGCGGG
GGATTGGCGG AGGCGAACGA CGAGAATAAT AACGGCGCGG AGACGCTGCC GGAGGGGGTG
CTGGTGCGTA AGTCCATCGA TGACGCGCTG AGCGCGATTT CGTCGAGCGA AAAGAGGATT
GAGAAGACGT TTGTGATCGG TGGGGCGCAA ATTTACGAAG AGGCGCTTCA AAGCGAAAAG
TGCGAAGCCG TGCACCTCAC CGAGGTGGAG GGCGAATTCG AATGCGACGC CTTCATCCCG
AAGATCGACG CCACCAAGTT CAAGCTCTAT GGACAGTCCA AGCCCATGAT TGAGAAAGGC
ACGAGGTTTC AATTTTTGAC GTACGTCACT GCCGACGCCG AGAGCGGAAA GTTCCGCCCG
AAGGCCGACG AGGTTCTGCC CGCGGGTTGC TCGATCAAGC ACGAAGAATA CCAGTACTTG
GAAATGATTC GCGAAATCAT CGATCAAGGC GCGGTGAAGG GCGATCGCAC CGGCACTGGG
ACGATTTCTA CGTTTGGCAA TCAAATGCGT TTCGATCTTC GCCGATCGTT TCCACTTCTC
ACGACCAAGC GCGTCTTCTG GCGCGGCGTC GCGGAAGAGT TGCTGTGGTT CGTCGCCGGC
GAGACGAACG CGAACAAGTT GGCCGAGAAA AAGATCAACA TCTGGGATGG TAACGGAAGT
CGCGAGTACT TGGATTCTAT TGGTTTGACT GAACGCGAAG TCGGTGATCT CGGTCCGGTG
TACGGATTCC AGTGGAGACA CTTCGGCGCC GAGTACACGA ACATGCACGC CGACTACACT
GGCAAGGGCG TGGACCAACT CGCTGAGGTC ATTCACAAGA TCAAGAACAA CCCGAACGAT
CGTCGCATTT TACTCACGGC GTGGAATCCG GCGGCGTTGA AGGAGATGGC GTTGCCACCG
TGCCACATGT TCTGCCAGTT TTACGTTGCC AACGGCGAGT TGAGCTGCCA AATGTACCAA
CGCTCGTGCG ACATGGGTCT GGGCGTTCCT TTCAATATCG CTTCGTATTC CTTGCTCACG
TGCATGATTG CGCAAGTTTG CGGTTTGAAA CCTGGTGATT TTGTGCACTG CTGCGGAGAC
ACGCACGTAT ACTCGAACCA CGTGGAGCCG CTCGAAAAGC AGCTCGCGTG CGAGCCGCGA
CCGTTTCCGA TTTTGAAAAT CAACCCGGAA AAGAAGGATA TCGACTCCTT CACCTTTGAC
GACTTCGAGA TCGTCGGTTA CGATCCCCAC CCTAAAATTG AGATGAAAAT GGCCGTCTAA
CGGCGCCCCG CGTACGTTGT GTCAGTATCA A
 
Protein sequence
MNDGKFQVVV AATREGGIGR ENALPWRLAG DMGYFKKITS ETRDGDAMNA VVMGRKTWES 
IPGKFRPLPG RLNIVLSRSG GLAEANDENN NGAETLPEGV LVRKSIDDAL SAISSSEKRI
EKTFVIGGAQ IYEEALQSEK CEAVHLTEVE GEFECDAFIP KIDATKFKLY GQSKPMIEKG
TRFQFLTYVT ADAESGKFRP KADEVLPAGC SIKHEEYQYL EMIREIIDQG AVKGDRTGTG
TISTFGNQMR FDLRRSFPLL TTKRVFWRGV AEELLWFVAG ETNANKLAEK KINIWDGNGS
REYLDSIGLT EREVGDLGPV YGFQWRHFGA EYTNMHADYT GKGVDQLAEV IHKIKNNPND
RRILLTAWNP AALKEMALPP CHMFCQFYVA NGELSCQMYQ RSCDMGLGVP FNIASYSLLT
CMIAQVCGLK PGDFVHCCGD THVYSNHVEP LEKQLACEPR PFPILKINPE KKDIDSFTFD
DFEIVGYDPH PKIEMKMAV