Gene OSTLU_18694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18694 
Symbol 
ID5006283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp186403 
End bp188796 
Gene Length2394 bp 
Protein Length797 aa 
Translation table 
GC content68% 
IMG OID640421704 
Productpredicted protein 
Protein accessionXP_001422120 
Protein GI145355763 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0344474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000276573 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGCCGT TCGCGCGCGA CGCGGTCGTC GCGCTGGACG CGGAGGCGGC GGCGCAGGCG 
CGCGCGCACG CGGGCTGGGG CGGGACGAGC GGGACGGGAC GCGCGCGCGC GACGCGGGCG
CTCGAGGCGT ACGCGACGAT GGACGACGCG ATGGGCGACG AGGAAGAGGT CTGGGACGTC
AGGGGGGAGC GGGTGCCGGC GGAGGCGCCG CTGGTGATCT GCGTCGCGGG ACGGCTGGGG
GAGGCGGCGG CGAACGTGCG GCGGTGCGTG CGGGCGCGGC GACGCGCGAC GCGCGCGACG
GTGTTCGTCG GATGCGACGA GGGCGATGAG GCGCACGGGG CGTGCGTGGA GGCGCTGACG
GAGGCGTGTG AAAGGATTTT GAGCGACGCG GCGGCGGAAT TCGGGGAGAC GCCGACGGGG
AATCGTTCGG GGGTGGGACG GAACGAAGAA GGAGACGAAG ACGACGAAGA GGACTGGGGG
AGTTGGGGCG ACGAGGACGA GGCGAGAGAC GAGGCGCCGG TGGAGGACTC GAGGGAGGGG
ACGGGCGATG GGTGGAACGA TGCGCCGACG CCGACGTCGA AACAAGGGCG AAATCGCACC
GCCGCCGCGG ACGCCGTCGC GGGTCGGTTT TCGGTCAAGT TTTTCCCTCC TATGATGTAT
CGCGCGCTCG GCGACGGTGC GTTCACGCTT CCGCGAACGC GAACGATGGG GTTGGTGAAC
GATGCCGCGG CTGCGTTGAC GGGGGAGTCG AAAGAGAATC GCGTCATCGG GCATCACTTG
GCTGAAATCG CCGCGCACTG GGCGTTGGCG CCGGATTATT TCGCCCTCGG TCCGAACGCC
GAAGCCGTGT CGCGCGTCGC AGCGCAAGCG AAGACCGATC CCGTGGGCGT GGACACGACG
GTGAAACCAC GAACGGCGGC GATCATCGTC GTCGATCGCG AGGTGGACTT GATGACGCCG
AGCGTGAGTC GAGACGGTTG GCTCGAGCGC GTGTTGGAGA CGACGGACGA CGAAGACGCG
GCGTCTTCAT CGACCTACGT CGACCGCGTC ACGGCGACGT TGTCCCCGCT CTTGAGCGAT
GAAAATGTTC TCACGTTAGA TGAAGCTCTG TGCGCGAAGA CTGCGCGTGA TGGCGCTGTG
CACGTGCGGA AGTTGCTGCG CGAGGCGGCT CGCGTGGAAT CCGTCGCCGC GCCGGCGGCG
GATGGAAAGA GCGCGCGCGT CGTCGGTGCT GACGATATTC TTAGCTTAGT GCGAGCGCTG
GAGGTTGATC CTAGCGTTGC TTTGCGTCAT CGCGCGTTGA TTCAGCGCGC AAAGCTCACC
GCGCGGAGTT TGACGGATGA GAACGACATG AAGGCGAATA GGCAAATCAT CGCTTTGCAA
CGACTCACCG CCGCTGCGCT CGAGCGCCAA GCCACGGGCG TGTGCGCGAC GGTGGTAGAG
ATACTGAAAG TGATGTATTC TGCGGGAGGT ACCGCCGTAG GTCACCCGAG CGAGGCGCTG
GCGCTCGTCT TAGCGGCGTA TGTACTCGCG ACGGAGGCGA ACGTTCAAGC GCAGGCGCCG
ACGAACGCCG CGTCGCCGTT CACGGCGCAA GACGAGGCGT CCGTGCGCGA CGCACTTTTG
GGCGCACTAC TGGCGAGCGA TCTCGCCGAT GTGAAGAAGC AAGTTCCAAG TTTCAACGCC
AGTGCGCTCG AAGCGCTCGA AGCGCTTCAG AACGCGACCG CGGCCGCGGC CGCCGACGCG
ACGCCGACGC CCTCGAAAGA TGACGACGGT TGGGACGACG ACGACGACGA CGACTGGGGC
GACGATGAAT GGGGCAACTC TCCGACGGCG CCGAGTAAAT CCACGACGAC GAACGCGATG
ACGGCGATCG ACGATCCCGA ACTCGCCGCC GCCGCGCTCG AGGCGCGAGA CGCCGTCGAG
CGCGCGCTTC ACAACTTCGC CCTCGCCGCG CATCGCGGCC GCGCATCGCT CAAGCACAAC
ATCCCAGAGT CTAACTCACT CTACGCCAAC GGTCTACCCA ACTCCATCCT GCTCGACATC
ATCTCCCGCG TCAAGACGTC CGCCGACGAC GGCGGCGCGT GCGCCGATTT CGTCCACGTC
GCCGCCTCCC TCGGCGGCTT ACTCAAGCAC GCCGCCGCCA CCGCCACCGC ATCCATCGTC
ACCGGCGCCA TGGGTCGTCT CGGCAACCTC ATCAACAAGG TCACCGCCGC CCCCAAACCA
TCCGATCGCG ACGTCGTCGT CGTCTTCCTC CTCGGCGCCC TCTCCTCCGG CGAATTCGCC
GCCGCCCTCA CCGCGCGCGC CCCCGATCCC GCCGCCGCGC TCTTCGCCCG TCACAGGGAT
CGCGAGTTCA TCTTCGGCGC GCTCGACCTC GCGTCCGCGC GCGCCATCGC GTGA
 
Protein sequence
MAPFARDAVV ALDAEAAAQA RAHAGWGGTS GTGRARATRA LEAYATMDDA MGDEEEVWDV 
RGERVPAEAP LVICVAGRLG EAAANVRRCV RARRRATRAT VFVGCDEGDE AHGACVEALT
EACERILSDA AAEFGETPTG NRSGVGRNEE GDEDDEEDWG SWGDEDEARD EAPVEDSREG
TGDGWNDAPT PTSKQGRNRT AAADAVAGRF SVKFFPPMMY RALGDGAFTL PRTRTMGLVN
DAAAALTGES KENRVIGHHL AEIAAHWALA PDYFALGPNA EAVSRVAAQA KTDPVGVDTT
VKPRTAAIIV VDREVDLMTP SVSRDGWLER VLETTDDEDA ASSSTYVDRV TATLSPLLSD
ENVLTLDEAL CAKTARDGAV HVRKLLREAA RVESVAAPAA DGKSARVVGA DDILSLVRAL
EVDPSVALRH RALIQRAKLT ARSLTDENDM KANRQIIALQ RLTAAALERQ ATGVCATVVE
ILKVMYSAGG TAVGHPSEAL ALVLAAYVLA TEANVQAQAP TNAASPFTAQ DEASVRDALL
GALLASDLAD VKKQVPSFNA SALEALEALQ NATAAAAADA TPTPSKDDDG WDDDDDDDWG
DDEWGNSPTA PSKSTTTNAM TAIDDPELAA AALEARDAVE RALHNFALAA HRGRASLKHN
IPESNSLYAN GLPNSILLDI ISRVKTSADD GGACADFVHV AASLGGLLKH AAATATASIV
TGAMGRLGNL INKVTAAPKP SDRDVVVVFL LGALSSGEFA AALTARAPDP AAALFARHRD
REFIFGALDL ASARAIA