Gene OSTLU_18496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18496 
Symbol 
ID5005884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp151305 
End bp153080 
Gene Length1776 bp 
Protein Length591 aa 
Translation table 
GC content55% 
IMG OID640421305 
Productpredicted protein 
Protein accessionXP_001421987 
Protein GI145355476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.929094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCGC GGGATGGTTT GGTTTTCGTG TCCTATGTCT CGGACGGCTT TCACGAGTTC 
GCGTTGAATT GGCTGAAGTT GTTGCGCAAG GCCAAGGGAG CGCAGCCGAA CGAAAAGGAT
GAGAACATCG TCATGCTCGC TCTGGACGAA GCCACGGAGC GATTTTGCGA GCGGCACTCC
ATGCCATGTT TTGGGGGCGC GAATTATAGG TACAAAGGAG GGGTCATGGC CACCGGAGGC
ACGGCGCTCG GCGACGCTTC CGGCGCGCGA CAGGCGGCGA GCGTGGCAGA AGCCGCCAAG
GCGATGCGCG AAATGACGAC GCTGCGGGTG AAGCTCTTGC TCGATCTTCT CGACCGTGGA
CACGATGTAT TAGTGAGTGA TGCTGACGTG GCTTGGTTGA GAGATCCCAG GGAATGGATG
CGCGAGGCGA TGACAGACGT CGACGTCGCC GCGAGCACGG ATTGTCTCAA CGCTCGCGAC
GACGACGAAG GAAAGTGTTG GGGAGCGCCG ACAAACACGG GGATTCTTTA CTTTAACGCC
ACGGAGCCGG CGAAAAAGTT CATAGCTGAT TGGGTCGATG GGATGGAAAA AGCCACGGAA
GATACCACGG AGAGAGACCA AGAGATTTTC AATAAACTGC TCATCAAACG ATCCTCCACA
TCGGAATCGC GCGAGATCAA AAGGCGCGTG CGCGTGAAAC GACTCGAGGG AGGCGTTCAG
TTTGCGCTGT TGCCAATGCG TCTCTTCGCT TCGGGACACA CGTATTTCGT TCAACGATTA
CACGAGCGCG AAACAAGACT GGATGAACAG CCGCTGTGCG CGCACGCAAC TTTTCAATTT
TCGCAAGTAC ACGGCAAACG CCAACGTTTC CGCGAGCACG GTTTGTGGGA TGTCGAAGAG
GACGATTATT ACACACAAGG CAACTTTATC GCCATGAGCG ATGAATTACC GAGCGTCTGG
AACGCGACTG GCGTGCATAA TCATCTCCTC GCCGCCGCGT GGTATCGCGC CTCGATTAGA
AACCTTCTAG CGCTCGGTCG AGTGTTGAAT CGTACGGTGA TTTTGCCACG CATCACGTGC
ATGTGCGATA GGTATTGGGG CCATGCGTTA CCTTCGTGCG CGATAGGTTA CCTGCATCCA
CCGTTCGTAG GATGTCCGCA GGATCACATC ATGAACCTGC CGGCGATGGA GAAGGGTGGG
GCAAACTTTA GAGAGTGGTC GTTTCTCGAC AACGCGCGAA CGTCAGACGC CATAAGAAAC
AGCGTCGCGG AGGTGTCGAC AGTAGACAAA GATTCAGAAG CAAAATACGC ACTCGGTCCG
TTTGCGACCG ACACGCAGGT TCTGAAATCT CTAGGCTCTG CCCGAGAGCG AGTGCTCGTC
GTCGATGGTG CGTTGACGTC GTTTTGTGCG TTTGACAGCG AAGCATCTTC TCCCGCGGCG
AAAAGCTTTG ATTCGGATAT GAACATCGCG CTAAAAGCGG AATCTTGGTT TTGTGGTCCC
GAGGACGCAA AAGGGGTGAA GTGCGCCATC GGATTCCCAG TTCCCAAGCC CACAAGCGAA
CTCCGCGGCG AATGCGCCCG TCTTCGGTCT GTTGCGAAAA CGTTCAGCTC GCAACCTCAT
TCGCTCCTGT CCAACTTCTT CGTGAACTAC GGCCAAGACG ATAAAAGTAA CGTATTCGGG
ATCCCTAAAG ATAGTAGTAG CGCACGGCAG AACGCATCTG CGTTCGATAT CGACTTAGAC
GACGTAGACA CACACCCGTT CGCTGTTGAC GAATAA
 
Protein sequence
MLARDGLVFV SYVSDGFHEF ALNWLKLLRK AKGAQPNEKD ENIVMLALDE ATERFCERHS 
MPCFGGANYR YKGGVMATGG TALGDASGAR QAASVAEAAK AMREMTTLRV KLLLDLLDRG
HDVLVSDADV AWLRDPREWM REAMTDVDVA ASTDCLNARD DDEGKCWGAP TNTGILYFNA
TEPAKKFIAD WVDGMEKATE DTTERDQEIF NKLLIKRSST SESREIKRRV RVKRLEGGVQ
FALLPMRLFA SGHTYFVQRL HERETRLDEQ PLCAHATFQF SQVHGKRQRF REHGLWDVEE
DDYYTQGNFI AMSDELPSVW NATGVHNHLL AAAWYRASIR NLLALGRVLN RTVILPRITC
MCDRYWGHAL PSCAIGYLHP PFVGCPQDHI MNLPAMEKGG ANFREWSFLD NARTSDAIRN
SVAEVSTVDK DSEAKYALGP FATDTQVLKS LGSARERVLV VDGALTSFCA FDSEASSPAA
KSFDSDMNIA LKAESWFCGP EDAKGVKCAI GFPVPKPTSE LRGECARLRS VAKTFSSQPH
SLLSNFFVNY GQDDKSNVFG IPKDSSSARQ NASAFDIDLD DVDTHPFAVD E