Gene OSTLU_18543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18543 
Symbol 
ID5005918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp242217 
End bp243662 
Gene Length1446 bp 
Protein Length481 aa 
Translation table 
GC content66% 
IMG OID640421339 
Productpredicted protein 
Protein accessionXP_001422015 
Protein GI145355532 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000142314 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGCGCG CGCGCGACGC GCGCGCGCCG CGGGCGTGGA CGGCGGCGCA CAAGCGCGCG 
CGCGAGCACG ACGAGCACGA CGACTGGCGA CGACGACGGC GACACGTGCT CGCGTTCACG
AGCGCCGGGA AGCCGGCGTG GACGCGGTAC GGGTCGACGC ACGGCGTGAG CGGGCTGTGC
GCGACGCTGC AGGCGCTGGG CGCGGTGAGC GCGAGCGCGA TGGGGGGGGA GCTGGAGGGC
GCGGAGGTCG GGGGGAGGCG GTTGGCGAGA CGGGCGAGCG GCGCGATGAC GTACGCGACG
TCGAGCGATC GAGGCGAGAG CGCGGCGACG CTGGAGAAAC ATCTGGAGTG GGTCGAGCGG
GGGGTGAAGA TGCTGGTGAC GAACGCGGGA TTGGAGACGG CGCTGGCGAA GAATGCGAAA
TTTGACGTCG GGCGGGCGCT GGCGACGGCG ACGCGGGCGG ACGAGGTGTT GGGGCAGTTG
GCGAGGAATT TGAGCTGGGA GACGTGTTAC GTGTTCGATA CGTACACGGC GGTGCGCGTG
CGAGCGGAGG TGCGGGAGAC GGTGGCGCGG GCGATGGTGG AGGCGATGAA GACGGTGACG
AAACCGTTCG CGTGCGTGGC GTTCGGAGAC GACGGTCGCG TGGGGGCGTA CGCGAAACCG
CGAGGACATC GACCGACGAC GATCGCGGCG AGCGATTTGA TCGTGTTGAT GAATTTTCTG
CGCGTCGTTT CGCGCTGCGA AGGCGACGAA GACTCGTTCA CGCGCGTGTG TTTGCCGGAA
TTTAATCCAG ACGGGTTCAT GCGCGCGTAC ACGGCGAGGC TGCGGGCGCG CGACGGCGAC
GCGGCGGGCG ATGAGGGACA ACCGGTGAAG TCGACGCGCG CGTCGTCGGG TTCGATCGGG
ATTTGTATCA TAACCGCATC CGCGGACGCG ATGGAGGAAT GTCGCGCGGC GAGAGACGCC
TTGGAAAAGC GCTTGAACGA CGACGGCGCT TTGGCGCAGA TGGTCGCCGC CGCCGGCGAA
TCGATGTCCA TCGCTAAACT TCCATCCGAA GCCCTCGGTG GGTTTCAGGA CGCGAGTGAG
CCGCTCTTGC ACTTCGTGTA CAACCGCCCC GCGCGTCATC AGCACGTTTC TTCCGCGTTC
AGTCCGTCGT TGAACGGCGC CGACGTCAAG GCGATTACTC GAGCGTACGC GTCGACGTAC
ACTTCGATGA AAGAGACCGA AGGCGTCGTC GACGCGAGCG GACCCGGACC CAAATTCATC
GGCGCCGCGC AGCGCGTTCG ATACGAGCGT CGCGCGCGCT TTTCCGTTTT GGCCTGCGTG
GGCGGAGATT TTGAGATTTA TTTAACTCTC AAACCGTCGA CGACGACGAC GACGGCGGTG
GCGCTGTGCA ATCGCCTGTG CGTCTGGCTC CGCGTGCACG AGCCCGAGTT ATTCGTGGAG
GAGTAG
 
Protein sequence
MTRARDARAP RAWTAAHKRA REHDEHDDWR RRRRHVLAFT SAGKPAWTRY GSTHGVSGLC 
ATLQALGAVS ASAMGGELEG AEVGGRRLAR RASGAMTYAT SSDRGESAAT LEKHLEWVER
GVKMLVTNAG LETALAKNAK FDVGRALATA TRADEVLGQL ARNLSWETCY VFDTYTAVRV
RAEVRETVAR AMVEAMKTVT KPFACVAFGD DGRVGAYAKP RGHRPTTIAA SDLIVLMNFL
RVVSRCEGDE DSFTRVCLPE FNPDGFMRAY TARLRARDGD AAGDEGQPVK STRASSGSIG
ICIITASADA MEECRAARDA LEKRLNDDGA LAQMVAAAGE SMSIAKLPSE ALGGFQDASE
PLLHFVYNRP ARHQHVSSAF SPSLNGADVK AITRAYASTY TSMKETEGVV DASGPGPKFI
GAAQRVRYER RARFSVLACV GGDFEIYLTL KPSTTTTTAV ALCNRLCVWL RVHEPELFVE
E