Gene OSTLU_33760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33760 
Symbol 
ID5006382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009372 
Strand
Start bp53214 
End bp54860 
Gene Length1647 bp 
Protein Length548 aa 
Translation table 
GC content43% 
IMG OID640421803 
Productpredicted protein 
Protein accessionXP_001422325 
Protein GI145356204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.125053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000675839 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAGGTCA GTATTGTTGA AGCCGGAGAT GCAAATATTT GTGGATCCAA ATGGGTAGAT 
AAGGTAAAAC CAGATGGTTC CCTTAAGTCT AGATTGGTAG TGCAAGGATT CACCCAAGTT
TGGTTGAAGG ACTACCACGA CACATTTAGT GCCGTTGCTT CCATGACCAC ATTTAGGATA
CTCATAAACT TGGCGGCGAT ATTAGGATGG GACATCTTTA CGATAGATGT ATCCCAAGCT
TACACGCAAG GGGAACTCCT AGATGATATA TACGTCAAGG CCCCAAGGTC ACATCCGCTT
CCGAAAGGAA TGGTCTACAA GTTACGCAGA CCCTTGTATG GCACAAAGCA AGCGGGACGG
TGTTGGTATT TGCATGTAAC CAAGACTCTA AGGTCTCTGG GTTTGAACCA ATTATGTAAG
GATAGCTGTT TGTTTGTTAA GATGAGCAAT AGCAAGCCCT TCATGATCAT TTCAGTATTG
GTAGATGATC TTCTGATAAC CGCTGAGAAC GATGAGGTTG TCAAGCAGTT CCACAAAGAA
TTTTCCAGAA TTTACAAAGT TTCGCAATTT GAAAGGATAA AGGTGTACAA CGGCATCCAT
ATAAAGAGAT TAGGGAAGAA TTGTTACACA CTCAACCAAG AATATTCAAT TGCGCAGTTT
CTGGCAAAAT GTCCTGTGCA AGATATCAAT GCTTGCAATT CGCCATTACT ACCATCAGAT
ACGTTTGTTC TCGCCAAGGA GGATGACACA GATGCAGTAG ACAGGATGAA GCGGACAACA
TATCAACAAG TGCTGGGAAG TCTGAATTGG TTTAATACTG CTACTCGACC AGATCTTGCT
GTGGTATGCA GTCTAGCTGG AAGGGTAGCA AGTAACCCAA CGCATAAGCA ATTTAGCGCT
TTATGCAGAG CGGTTGGATA TCTCAAGAGG AACCCCAATA TTCCATTAAC CTACAACGGT
GCTGAATGCA ATGGTATAGT GAGACTTGCA GGATTTACAG ACTCGGATTG GGCAGGCCAA
AAGTTATCCT TAAATTCGAA AGATAGATGC GGAAGAAAAT CCACATCGGG GTATATATCT
TTTTCATGCG GCCCTACCAA TTGGAAGAGC AAACTGCAAG GAATACCAGC CACGAGTTCA
GCGCAAGCAG AGTTCATGGC AATGTATGAA GCCGCGAAAG ATTTATTCTT TCAAATTTTG
TTGTTTCGAG AGCTTGGATT TAAACTGTCA AGAGTACCAC TCTTCTGCGA TAATACGACA
GCCATTCGAC AGGCAATGGA AACTATGTCG TCTAAGTCAA ACAAGCATAT GGAAATAAGA
TACTCCTGGA TCCAACATTA TGCTCACAGA GAAGGGATTA TACAGCCATT TAACATAGGA
TCATCACACA ACTTAGCCGA TATGTTGACA AAGATATTGC CGAACAAGAA GAACTTTTCA
GGTCCAGCAG ACGTTCACGA GTGTTCTAAT CACTTCAACG TGATGTTAAG TCATATATCT
TCGAAGGACA TACGAGACTT CATCAACCAG AGGTTGAGTG AAGGTATGGT GAAGAGTGAC
GCTCTGCACA CATTTCAAGA ATATCTCGAG AAGGTCGAAA GTGAGGAAAT CCAACCTTTC
AAAGGTATTG GTAAGCCCTC AGACTAG
 
Protein sequence
MEVSIVEAGD ANICGSKWVD KVKPDGSLKS RLVVQGFTQV WLKDYHDTFS AVASMTTFRI 
LINLAAILGW DIFTIDVSQA YTQGELLDDI YVKAPRSHPL PKGMVYKLRR PLYGTKQAGR
CWYLHVTKTL RSLGLNQLCK DSCLFVKMSN SKPFMIISVL VDDLLITAEN DEVVKQFHKE
FSRIYKVSQF ERIKVYNGIH IKRLGKNCYT LNQEYSIAQF LAKCPVQDIN ACNSPLLPSD
TFVLAKEDDT DAVDRMKRTT YQQVLGSLNW FNTATRPDLA VVCSLAGRVA SNPTHKQFSA
LCRAVGYLKR NPNIPLTYNG AECNGIVRLA GFTDSDWAGQ KLSLNSKDRC GRKSTSGYIS
FSCGPTNWKS KLQGIPATSS AQAEFMAMYE AAKDLFFQIL LFRELGFKLS RVPLFCDNTT
AIRQAMETMS SKSNKHMEIR YSWIQHYAHR EGIIQPFNIG SSHNLADMLT KILPNKKNFS
GPADVHECSN HFNVMLSHIS SKDIRDFINQ RLSEGMVKSD ALHTFQEYLE KVESEEIQPF
KGIGKPSD