Gene OSTLU_18387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18387 
Symbol 
ID5005744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp360635 
End bp362359 
Gene Length1725 bp 
Protein Length574 aa 
Translation table 
GC content68% 
IMG OID640421165 
Productpredicted protein 
Protein accessionXP_001421635 
Protein GI145354740 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0535832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGCG CGTGCCGCGC GCGGCGCGCG AGCGCGAGAG AGACGACGAC GCGCGCGGAA 
TTCTGCGCGC CGTGTCGCGA CGGCGCGAGA GCGACGACGA CGCGCGCGAC GCGCGTCGCG
GACGGGGCGT CGTTCGACGT CGTCGTGTGC GGCGCGGGCG CGGCGGGGAC GTCGTTCGCG
CGGACGTACC TGGAGACGAC GGCGCGGGGT GGCGACGGCG ACGGCGACGG CGACGGCGAC
GGCGACGGCG CGCGCGGCGC GAGGCTGCTG CTGCTCGATC GCGGCGGATG CGCGAAGACG
GTCGAGCGCG TGGAGCGCGT GACGCGGCGG GCGTCGGTGT ACGGCGCGCG GGAGGACGCG
CTGGGGGCGC TGGAGGCGCT GGGGGCGCGG ACGGCGACGG CGAGCGTCGC GGCGTGCGAC
GCCGAGTCGA AGACGTTGAC GCTGGACGAC GGCGCGCGCG TGCGGTACGG CGCGCTGTGC
GTCGCCACGG GGGCGACGCC GCGATGTCCA CTGCCGGAGG CGAGCGATGG AGCGGTCGAC
GCGCACGAGG TGCGAGACGT GGAGAGCGCG GACGCGCTGG CGAGACGGTT GAGCTCGATG
ACGGCGACGG CGAGCGCGGA TTCGGATTCA AAGACGAAAC GGATCGCGAT CGCGGGGAAC
GGAGGGATCG CGCTCGAGCT CGTCGACGCG CTATGTGTGC GAGGTTTGCG CGCGCGAGGG
TTGGAGGCGT GTGAATTGGT GTGGTTGGTG AAACACGGCG AAGTCGGCGA CGCCTTTTTC
GACGTCGACG CCGCGGATTT CTTGCTGCGC GCGCTCGACG CGAGACGGCG AGACGGCGAG
GCGAAAGACG ACGACGGCGC GGACGTGGAC TGGGACTCAC CGACGCCCGA ACGCGGTGCC
GGACGATCGA CGAAGAAGCG CGCGCGAGGG CGCGAATCAG GCGCCGCCGC CGGTCCGGAT
TGGCTCGACA GATTTAGAGC GAAGAGCGCG GCGGACGACG CGCGCGCGCC TCTCAGCCGC
ATGAAGCTTC GCGTGCTGAA AAATGTGTGC ATACGCGAAG CGCGTAAAGA CGCAAACACC
GGCGTCAACG TATTGACGCT GAGCGATGGG ACGACGATCG AGGTGGACGC CGTCGTCGCC
GCCGCGGGCG TCGAGCCGAG ATGCGATTGG CTCGACGAAG TCGCCGCGCC GAGGTCGAAA
TCAGACGGCG GTATACTCGT CGATGCGTGC ATGCGCACCG TAGGACCGTA CGGCGACTCA
ATATTTGCCG TCGGCGACGC GTGCACGATG TCGGCGCGCG CGTCGAACCC AGAGACGCCG
TGGTTTCAAA TGCGACTGTG GAGCCAGGCG GCGCAAACAG GCGCCTTCGC CGCGAAAGTC
GCCGCGGGCG TCTGCGACGC CGACGCCCTC GGATTCAATT TCGAAATCTT TACCCACGTC
ACCAGGTTTT TCGGTCTCAA AGTCATCTTG CTCGGGCTGT ACAACGCCCA GAAGCTCGAC
GACGTCCCGG CGAACGAGGT GACGACGTAC CAGCGCGAGT CGCTCGCCGA CGCGACGTAC
GTGCGCGTGC TCCTCGTCCG CGGTCGCATG ATGGGCGCCG TGCTCGTCGG CGACACCGAT
CTCGAGGAAA CTTTCGAAAA CCTAATCTTA GACGGCGTCG ATCTATCGCG TTTCGGTCCG
AGTTTACTCG ATCCCGAGCT CGACTTGGAG GATTATTTCG ACTGA
 
Protein sequence
MCGACRARRA SARETTTRAE FCAPCRDGAR ATTTRATRVA DGASFDVVVC GAGAAGTSFA 
RTYLETTARG GDGDGDGDGD GDGARGARLL LLDRGGCAKT VERVERVTRR ASVYGAREDA
LGALEALGAR TATASVAACD AESKTLTLDD GARVRYGALC VATGATPRCP LPEASDGAVD
AHEVRDVESA DALARRLSSM TATASADSDS KTKRIAIAGN GGIALELVDA LCVRGLRARG
LEACELVWLV KHGEVGDAFF DVDAADFLLR ALDARRRDGE AKDDDGADVD WDSPTPERGA
GRSTKKRARG RESGAAAGPD WLDRFRAKSA ADDARAPLSR MKLRVLKNVC IREARKDANT
GVNVLTLSDG TTIEVDAVVA AAGVEPRCDW LDEVAAPRSK SDGGILVDAC MRTVGPYGDS
IFAVGDACTM SARASNPETP WFQMRLWSQA AQTGAFAAKV AAGVCDADAL GFNFEIFTHV
TRFFGLKVIL LGLYNAQKLD DVPANEVTTY QRESLADATY VRVLLVRGRM MGAVLVGDTD
LEETFENLIL DGVDLSRFGP SLLDPELDLE DYFD