Gene OSTLU_26731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26731 
Symbol 
ID5004829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp115418 
End bp116758 
Gene Length1341 bp 
Protein Length446 aa 
Translation table 
GC content59% 
IMG OID640420250 
Productpredicted protein 
Protein accessionXP_001420760 
Protein GI145352876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.16188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCT CACGCGCGCA CGGCGTCGCG GACAAGCAAT GTCCCGTCTG TCGCGAGCTC 
TGCGTCAAGC CCGCAGCGCT CCCATGCGGT CACGTCGCGT GTTTATGGTG CACTCACTGC
AGCATGAGTT CTCTCGGTGA ATCCACCTGC GCCCTTTGTC GCAATGAATT CACCGCGCTT
CCCGCGCTGT CGCGCGCGCT GGAGAATCAT CACATGTGGT GCGACCCGCG CGCGTTCTGT
GAGCGACTGA AAGAGGCTCG AAAGGAGGAC GTGGAGCGGG GACACAAGAG CCCCAAATTT
GAGGTGTGCG TCGTCGCGGC GGCGGCGAAC GACGACGCGC GATCGAAGCT CATCTCGTTC
GACGCGAGCG TCGTCGGCGT GGTTGATGAC GACCGTGATA TGGCAGATGT GGTTTTGAAG
GCGCTAGAGG AGACCGAGTG CGGGGGGATG TTTGAAGAGA TGCGGATGTT TGACGCGAGC
GCGGGCGAAG GCGTCGCCGC TTTTCACGCG TGCTGTTGTC CATCGCGAAC GGAGGTCGTT
TCGTGCGGTG AGCTCGCGAG TCGGCCCGTG GTTTGTCAAC AGTGTGGAAC GCTCTACGAG
CGAGCGCACG CGGACGCGCT GACAGCGAAA GCGGGTGAAG GGAACGAGTG GTGCTTTTGC
GGCGGGACAC GACCTCCGAC TCGCGTGATT TTTTCGCTCA AGGAAGACAT AGAGAGCGCG
TACCCGAAGG ACTTCGTCGA TGCGTCGAGA AAACGGTGCG ACCAAGCACT AGCGGAGTGT
TTAGTGGCGC TCGAGCGCGA GGCTGAGCTA AAGGCGAATG TGGAAGGCGA CGTTGACGAC
GTCGACGCAA AGGATGAAAC CGTCTCCACT TCGACGCACA TTGACGTTGG CGGCGCGCGT
ACTCTGGTCT TTGATCACGA AACTTTCACG CATTTCGGCG TCGGGTGCGA TTTCTGTGGT
GTGTATCCCA TCGTTGGTCC GAGGTATCAA TGCGCCGAGT GCAAGGATTC CGAGTTCATG
GGCTTTGATC TTTGCGCCAA GTGTATGCAA AACGTCTTCG AGCACCCAGA GCGAAAACGC
GACTATCGCT TCGCGCAGAA TCACACCGAC GCGCACGAGA TGGTTCTAGT TCGTCCGCGT
CCAACGATGG TTCACGTGAT GAAAAGCCTC CATCCCGAAC TCAGCGCGAA TCAAATCATA
CAATGGTTGG ACAATCAAGC GACGGCGTCG CGAGAAGCCG AAGCCGAAGC CGCCGAAGAC
GAAGCACAAG CCGCCGAAGC TGACGCATCC GAATCCGACG AACCCGAAGA AGGCGACATG
ACCGAGGACG AGCACCAATA G
 
Protein sequence
MSASRAHGVA DKQCPVCREL CVKPAALPCG HVACLWCTHC SMSSLGESTC ALCRNEFTAL 
PALSRALENH HMWCDPRAFC ERLKEARKED VERGHKSPKF EVCVVAAAAN DDARSKLISF
DASVVGVVDD DRDMADVVLK ALEETECGGM FEEMRMFDAS AGEGVAAFHA CCCPSRTEVV
SCGELASRPV VCQQCGTLYE RAHADALTAK AGEGNEWCFC GGTRPPTRVI FSLKEDIESA
YPKDFVDASR KRCDQALAEC LVALEREAEL KANVEGDVDD VDAKDETVST STHIDVGGAR
TLVFDHETFT HFGVGCDFCG VYPIVGPRYQ CAECKDSEFM GFDLCAKCMQ NVFEHPERKR
DYRFAQNHTD AHEMVLVRPR PTMVHVMKSL HPELSANQII QWLDNQATAS REAEAEAAED
EAQAAEADAS ESDEPEEGDM TEDEHQ