Gene OSTLU_16531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16531 
Symbol 
ID5003165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp520172 
End bp522238 
Gene Length2067 bp 
Protein Length688 aa 
Translation table 
GC content62% 
IMG OID640418586 
Productpredicted protein 
Protein accessionXP_001419408 
Protein GI145349990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00789069 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.505528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTCGA TGTTCGACGA CGCGCCGGCG ACGGCGCGCG GCGGCGGCGC GCGACGACGC 
GACGGCGACG ACGCGACGGC GAGCGACGAC GCGACGGCGA GCGACGGCGA AGGCGACGCG
CTGAAAATCA ACGAAAAGTA CGCCGCGCGG TTCGAACACA ACGAACGAAG GAAGGAGACG
CACCGATTGC AGGCGAAGCT GGAGCGGGAG TACGCGCGGG GCGCGCTGCG CGCGCGCGGC
GACGGGAAGA GCGAGAGCGA GACGAGCGGG GAATCGAGCG ATGGGACGTC GAGCGATGAG
GAAGAGACGC TGGAGCGAGC GCTGGACGGG GCGTTCGCGG AGGCGCTGAC GAAGATTCGG
AGGAAGGATC CGAGCATTTA CGACGCGGAG ACGAAGCTGT TCGACGAGGC GAGCGAGGAC
GAGGACGACG AGGACGGGGG GGGGAAGAAG GAGGCGACGT CGACGAAGAG GACGAAGAAG
AAGACGAGGG CGACGTTGCG CGAAGTCGTG GCGACGCAGT TGCTGGAGGG TGGGGCCACC
GCGCTCGAGG AGGCGGAGGC GGAGGCGGAG GCGGCGCGCG CGAACGACGA ACCGTCGTAC
GTCGAGGAGC AAGCGGCGCT GAAGCGCGCG TTCAAGGACG CGGCGGCGGG CGGTGACGAC
GACGAAGACG ATGAAGACGA GAGTGGTCTC GTGGTGAAAC GGCGCGCGGC GGCGATGACG
GCGGCGACGG AAAAGTTGTC TGAATACTTT GACGCGAGAC GCGGCGACGC GAACGATTTG
AGCGCCGAGG ATAAATTCTT GCGAGACTAC TTGCTGGAGA AACAGTGGAT GAAGGAAGAC
TCGAAGAAAG ATTCCGCGGC GGTGCGGTTT CAAACATTAG GGCCGCCATC GAGCGACGAG
GACGACGACG CGGGCGCCGA CGACGAGTCG AGCGATTCCG AACTCCTCGA TCGCGCCGAG
GCGTTCGAGC ACAAGTACAA CTTTCGATTC GAAGAGCCGG GCGCGGATCG GCTCGTGTCG
CACTCGCGTC ACATCGAAGG CTTAGTTCGT CGCGAGGATT CGAAGCGCAA GGACAAGCGA
AAGAAAGTGC GAGAGCGCAA AGAATCTGAG CGAGCCAAGC TCCTCGCCGA GGTGCGTCGT
CTGAAGAATC TCAAGCGCGA AGAAATCGCA AACAAGATGC GTCAAATCTC CGCCGTCGGT
GGGTTGAAGG GCGGCGGCGC GAAGGTGGCG GATTTAACTG AAGAATTCGA CCCCGAAGCG
CACGACCGAG CGATGCGAGA GATGTACGGT GACGAGTATT ACGACGCCGC GGGAGAGGAC
GGCGAAGACG AAGTCTTCGG CGAGCTGGAA AAGCCCGAGT TCGGGGACTT GGAGGAGGAG
ATGAAGGAAC TTTTGAAGGG TGCAGGGAAA CCTGACGATG ACGATTTAGA TGATGACGAT
CATTTCGACG ATGACGACGA ACCGGCGCCG GACGAAGAAG AAGAAGAAAA CAAATTTAGC
AAGCGCGCCG CGAAGAAGTG GCGCAAGGAG CTCGAGGCGA AGATGGACGA GTACTACAAG
CTGAACGCCG AAGATTTCAT CGGTGAAGCG CCGACGCGCT TCCCGTACAA GGAGGTGGCG
CCGAAGATGT TCGGTCTGAC CACGCGCGAC GTGCTTCTGA TGGAAGACAA GCACCTGAAC
CAAATCGTCG GCTTGAAGAA GCTCGCGCCG TATCGCGACG ACGCAAACGA CGCCGCCGTG
GACGCCAACC AACGCGCGAG AGCGCGTCGC ATGGCGAAAG AGTTCTTAGA GAAAGCCAAA
GACAAGAAGA ATCGTTCGTC GCGACGCAGA AAAGACAAAA CCAAGCGCGA GGACGACGAC
GAAGCTTCGG ACGGTTCTGA CGACGACGCC AAGGCGCGCG CGCGGTCGTA CGCCGACTCG
GCGTTCGGCA AAAAGCGTAA ATCCGAAGCG CCGCTTCGAA ACACCACCGC GGATGACGCA
TCCACCGGCG TGGGTAAGAA CGCTCGGAAA AACGCGAAGA AGCGCGCGAA GCGCAAAGCC
AAAGAAATAG CGAGCGCCGC CGTGTAA
 
Protein sequence
MRSMFDDAPA TARGGGARRR DGDDATASDD ATASDGEGDA LKINEKYAAR FEHNERRKET 
HRLQAKLERE YARGALRARG DGKSESETSG ESSDGTSSDE EETLERALDG AFAEALTKIR
RKDPSIYDAE TKLFDEASED EDDEDGGGKK EATSTKRTKK KTRATLREVV ATQLLEGGAT
ALEEAEAEAE AARANDEPSY VEEQAALKRA FKDAAAGGDD DEDDEDESGL VVKRRAAAMT
AATEKLSEYF DARRGDANDL SAEDKFLRDY LLEKQWMKED SKKDSAAVRF QTLGPPSSDE
DDDAGADDES SDSELLDRAE AFEHKYNFRF EEPGADRLVS HSRHIEGLVR REDSKRKDKR
KKVRERKESE RAKLLAEVRR LKNLKREEIA NKMRQISAVG GLKGGGAKVA DLTEEFDPEA
HDRAMREMYG DEYYDAAGED GEDEVFGELE KPEFGDLEEE MKELLKGAGK PDDDDLDDDD
HFDDDDEPAP DEEEEENKFS KRAAKKWRKE LEAKMDEYYK LNAEDFIGEA PTRFPYKEVA
PKMFGLTTRD VLLMEDKHLN QIVGLKKLAP YRDDANDAAV DANQRARARR MAKEFLEKAK
DKKNRSSRRR KDKTKREDDD EASDGSDDDA KARARSYADS AFGKKRKSEA PLRNTTADDA
STGVGKNARK NAKKRAKRKA KEIASAAV