Gene OSTLU_40743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40743 
Symbol 
ID5005746 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp365468 
End bp366952 
Gene Length1485 bp 
Protein Length478 aa 
Translation table 
GC content60% 
IMG OID640421167 
Productpredicted protein 
Protein accessionXP_001421637 
Protein GI145354744 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0562331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCG CGCCCGCGCC GCCCGTGCGC GTCGGTCACG CGCTGAGTGG GATATTCGTC 
ATCAACCTGC GCGGAGACGT GCTGCTGATG CGCGCGTACC GCGAAGACAT CGAGCGACAC
GTGTTGGATG CATTTCGCAC GCAGATCCTG AATCCGCGCG GTGGCGCGCG ACGCGACGCG
CGATCGACGC CTCGACGCGG ACGCAAAGAC GACGGATTCG CCACGGAAGC GCCGGTGCGA
AGAATCGGGA GCGTGACGTA CATGATGAAA CGGTCGAGAG ACGTGTACGT GGTGGGGATC
GCGCGCGGGC AAGGCGAACG GGGCGGACCG GGCGACGCGA ATTTAATGCT CGGATTCACG
TTCCTGGGAC ACGTCGTGCG GCTGTGCAAT CAATACTTTG GCGCGTGCGA TGAAAACGCG
ATCCGTGGGA ACTTTGTGCT GATGTACGAG CTGCTGGATG AGATTTGCGA CGACGGGTAT
CCGCAGATAA CCGCTGGGGA GACGCTGAAG ACGTACATCA CGCAGAAGGG TTCTAAACTT
GAAGGTGCGA TCGGAAAAGA GGCGATGGAA CGGAGCGCGG CGGAGGACCA ACGCCGGGCG
ATGGAGGCGG CGAAACAGGT GACGAGCGCG GTGCAATGGC GAAGAGAGGG GTTATCGTAT
AAGAAGAATG AAGTGTATTT GGACATCGTG GAGAGCGTGA ATCTGATGAT GAGCGCGGAA
GGCACGGTAT TGCGAGCGAA CGTGCAGGGT TCGATTTACA TGAGGACTTT TCTGAGTGGG
ATGCCAAACC TCAGCGTCGG GCTGAACGAT CGCCTCGGGG AGACGACGCG CGTGACGTCG
CGCGGCGAAG ACGCCGAGAC GAGCGCGGCT CGCGATCGAA GGCTGATCGA CCTGGACGAT
TTACAGTTTC ATCAGTGCGT GCGACTGGAT AAATTTAGCG CGGAAAAAGT GATCGAGTTC
ACCCCGCCCG ATGGCGAGTT CGAGCTCGTC AAGTATCGCG TGTCGGATAA CATCACGCTT
CCGTTCAAGC TCATGCCCGT AGTGAAGGAA CTCGGTAGAA CGCGTCTGGC CGTCACCGTC
AACCTACGCT CGCTCTACGG TCCCACGACC GTGGCGAACG AAATTAAAGT GCGAATCCCC
GTCCCCAAGC TCACCGCGCG GGCGACGATC AACGTGAGCG GGGGCAAGGC CAAGTACGTA
CCCGAGGAGG GCTGTCTTCG CTGGAAAATC AAAAAGTGCG CGGGTCACGA GGAATACCAG
CTCGACGCCG AGGTCTTACT CGCCAACACG CTGGAGGACC ACAAACCTTG GGTGCAACCG
CCGATAAACA TCGCGTTTCA CGTCCCGATG TTCACCGCCT CGGGCTTGCG AGTGCGCTTT
CTCGAAGTCA AGGAGGCGTC CAACTACGAC GTCGTCAGGT GGGTGCGATA CTTGTGCCAG
AGCGGCGGTT CGTCGTCGTC GTCGTACGAG ATTAGATGCG CGTGA
 
Protein sequence
MSSAPAPPVR VGHALSGIFV INLRGDVLLM RAYREDIERH VLDAFRTQIL NPRDDGFATE 
APVRRIGSVT YMMKRSRDVY VVGIARGQGE RGGPGDANLM LGFTFLGHVV RLCNQYFGAC
DENAIRGNFV LMYELLDEIC DDGYPQITAG ETLKTYITQK GSKLEGAIGK EAMERSAAED
QRRAMEAAKQ VTSAVQWRRE GLSYKKNEVY LDIVESVNLM MSAEGTVLRA NVQGSIYMRT
FLSGMPNLSV GLNDRLGETT RVTSRGEDAE TSAARDRRLI DLDDLQFHQC VRLDKFSAEK
VIEFTPPDGE FELVKYRVSD NITLPFKLMP VVKELGRTRL AVTVNLRSLY GPTTVANEIK
VRIPVPKLTA RATINVSGGK AKYVPEEGCL RWKIKKCAGH EEYQLDAEVL LANTLEDHKP
WVQPPINIAF HVPMFTASGL RVRFLEVKEA SNYDVVRWVR YLCQSGGSSS SSYEIRCA