Gene OSTLU_31402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31402 
Symbol 
ID5001272 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp821693 
End bp824805 
Gene Length3113 bp 
Protein Length900 aa 
Translation table 
GC content59% 
IMG OID640416693 
Productpredicted protein 
Protein accessionXP_001417611 
Protein GI145346262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0818576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGTCCCGCG CGACCGCTCG CCGCGCGCCG CGAGCGAGCG AAACGACGCG CGAGTTTTTA 
CCGCGCGCGT CGACCCGCGA GCGCGACGCG ACGCGCGTCG GACGCGACCG CGAACGCGCG
CGAGAGGCGA TCGGAGATTC AAACGCGATG AGCCCGCCGG GGTCGTCGGC GCGACACTCG
CCCGCGGCGT CGGGGGCGTC CGCCGAGCGG TGGTTGCAGA AGGCGTTCGG GGAGGCGAGC
GTGAACGTCG TCGATCGAGC GTCGGCGCGA CGGGAGGAGG TGCGCGGGAC GTCGCGCGCG
CGCGCGCGAA CGGGTCGAGG GCGACGCGGG ATTTCGAAAC GGTTTTTAAA CGGTCGAATT
CAGAGGATTC GGGCGGACGG GCGCGGAGAC GCGAGGTCGG AGACGCGCGC GGGTGATTCG
AGGACGGAGA ACGAGACTGA CGAATGATGC GCGGTGGTTC ACACGCAGGC GCGCGAATCT
TCGTACGGGG GTGCGAGGTA CGAGCGGGAG ACGGAGCGAC GCGCGGCGCC GGGGACGGCG
GCGAAGCCGG CGTATTGGGC GCACGGACAA GCGGTGGATT CGAGTCAAGA CGATGGACGG
GCATCTCCCA AGCGGTTCTT TAACAAGGCG AACGAGCGGT TGTACGAGGC GTATTCGCAG
TTGCACACGA TGGCGCAAGA GTTTGATAAG CCGTTTGATT CGCCGGCGAT TCTCGTCGTC
GGACACCAGA CGGATGGAAA GTCGGCGCTC GTGGAGGCGT TGATGGGGTT CCAGTTCAAC
CACGTCGGTG GTGGAACGAA GACGAGGCGT CCAATCGCGA TCAACATGAA GTACAACCCT
TCCGCCGTGG AGCCGAGGTG CTTTTTGATG AAGGACGACA ATCTTGGGCG CGAGGACGAG
CTCTCGCTTC CCGAGTTGCA GGCGCACATT GAAGGCGAAA ACAGACGATT GGAAAACGAG
AACGGGTTTT GGGCGAAGGA CATCGTCGTG AAGATCGAGT ATAAGTATTG CCCGAACCTC
ACCATCATCG ATACCCCCGG TTTGATCTCC GCCGCGCCGG GCCGAAAGTT TTCTGGTTTG
CAACAGTCTG CGCGCTTGGT GGAGGATTTA GTCAAGCAAA AGATGCACCA GCGAGACTAC
ATCATCTTGT GCCTGGAAGA CAGCTCAGAT TGGTCCAACG CCACCACTCG GCGATTGGTT
CTCGAAGCCG ATCCCGAGCT GCGACGTACC GTCGTGGTGA GCACCAAGTT CGACACGCGA
ATCCCGCAAT TTTCTCACTC GCAAGACGTC GAAATGTTCT TGCACCCGCC GGCGCGTCTA
CTCGAGCCGA CCGTCTTGGG TGGTGGTCCG TTCTTTTCCT CCGTGCCCTC TGGCCGCGTT
GGACTCGCAC GAGATTCGAA GTATCGCTCG AACGATCACT ATCGGGAGGC CGTTCTCGAG
CGCGAGCAAC ACGATATCGC CGAACTCGAG CGCCGCTTGG ATCGTCACCT GTCTTCGCAC
GAGCGCGGCC GCATCGGCGT GAGCCAACTG CGATTTTTCC TCGAGCGATT GTTGCAGCAG
CGATACTTGG AGAACGTTCC CACGATTGTT CCCGTGCTCG AACGCGAGCA TCGCATCGCC
ACGAGCAAGC TCTCTGAAAC CGTGCAAGAG TTGGCGGACT TGAACCAAGA TCACCTCAAG
GAGAAGGGTC GCGCCTTTTA TCAGCACTTT TTGGAAAAGA TTCCGGAGAT TGTCCGAGGT
ACGCTCGCCG CGCCGCCGCG CATCTTTGGC GAGTCTCTCG CGCACGAGCA CATTCGTGGC
GGCGCGTTCG TGAACGCGGA CGGTCGACCT TGCATGCCGC AGCAACCCGT GCCAAACGCC
GATATGCGGC TCTTCGGTGG CGCGCAATAC CATCGAGCCC TCGAAGAGTT CCGCTTAATC
GTCAACGCCG TGGAGTGCCC GCCGGTGAGC CGAGAAGATA TCGTCAACTC GTGCGGTGTT
GATGAAATCC ACAACGGTGT CAATTACACG CGCACTGCGT GCGTCATTGC CATCGCTCGG
GCTCGAGAAA CGTTTGAGCC CTTCGTGCAC CAACTCGGCT TCCGTCTGTC TCACATCGCG
CGACGAACTT TGCCTGTGGC CATGTACCTT TTGCAAAAGG AGGGACGAAT TTTGAACGGG
CACGAAGTCT TCTTGAAGAA GATTGGCGGC ACTTTCGCTC GATTCGTGGA CGACCGAGTC
AAGGAATGCC AAGAAAAGTG CCACGAAGAC CTCAAGTCTA CGACTCAGTT CGTCACTTGG
TCGTTGCACT CTGGAAACAA GGCTGGTCTA CGAAACGTCT TGGCGCCGCA TGAAGGCGAA
CATGACGCGA AAGACGAAAA GCGCCGCTCC GGTGGCGAGC TCGTGATGTC GGATAAGGAG
TTGAAGAACT GCAAGCTCAC GGATTTGGTT GAAAACACGC TCTGGAACAG AACGATGAAG
TCGGTCACGG TGGATATTGT TGACATGTTG GTTCGTCAAA TCTTCTCCGG TATTCGCGCA
CACATCGTTC AGTCGGTCGA GCTCAAGTTC AACTGCTTCT TCCTTATGCC TCTGCTGAAC
GACTTCAATA GTTTTTTGCG CAACGAGATG GAAGACGCGT TTGAGATTAG TCTCGACTCC
GTCTTCGACG TCAAGAGCGT CCGAAGCGCG CTCGAGTCTC GTCGTCACAA GCTTGAGAGC
GAGCTCGAGC AAATGGAGCA CATACAGTCC AAGTTTGCAT CGATCCACTC GCAGCTCGAG
GCATCGAGCG GTAACTCTCC TGCAACGCAA GCCGCCGCGA AGGCAGCCGT TGGTGGGACG
ACGTTCCACA GCGCAGCTGC GCGCGAGATG GCGGCGCAGG ACGAAATCCT TCGAGCGAGC
CACGATTTGA GTGAGGGCAT CGCCGAAGCC AAGGCGCAAT TCGCGCAATA TTCGCCGAAT
CCTATGAGAG GCTCGAAGGC GTACGAGCCA TCTCCCAGGC GCTCGTCGGC TGGCCGAAGC
CGAGACCGCG CGCCACTTTC GCCGATGTTC CACAACTAAT AGGCGTTTTA TTTATCAAGA
ATGACGAGGG GCAAAACGGT TGTAAAAGAT TCGCAGTTCT TGTATGTATT CAA
 
Protein sequence
MSPPGSSARH SPAASGASAE RWLQKAFGEA SVNVVDRASA RREEARESSY GGARYERETE 
RRAAPGTAAK PAYWAHGQAV DSSQDDGRAS PKRFFNKANE RLYEAYSQLH TMAQEFDKPF
DSPAILVVGH QTDGKSALVE ALMGFQFNHV GGGTKTRRPI AINMKYNPSA VEPRCFLMKD
DNLGREDELS LPELQAHIEG ENRRLENENG FWAKDIVVKI EYKYCPNLTI IDTPGLISAA
PGRKFSGLQQ SARLVEDLVK QKMHQRDYII LCLEDSSDWS NATTRRLVLE ADPELRRTVV
VSTKFDTRIP QFSHSQDVEM FLHPPARLLE PTVLGGGPFF SSVPSGRVGL ARDSKYRSND
HYREAVLERE QHDIAELERR LDRHLSSHER GRIGVSQLRF FLERLLQQRY LENVPTIVPV
LEREHRIATS KLSETVQELA DLNQDHLKEK GRAFYQHFLE KIPEIVRGTL AAPPRIFGES
LAHEHIRGGA FVNADGRPCM PQQPVPNADM RLFGGAQYHR ALEEFRLIVN AVECPPVSRE
DIVNSCGVDE IHNGVNYTRT ACVIAIARAR ETFEPFVHQL GFRLSHIARR TLPVAMYLLQ
KEGRILNGHE VFLKKIGGTF ARFVDDRVKE CQEKCHEDLK STTQFVTWSL HSGNKAGLRN
VLAPHEGEHD AKDEKRRSGG ELVMSDKELK NCKLTDLVEN TLWNRTMKSV TVDIVDMLVR
QIFSGIRAHI VQSVELKFNC FFLMPLLNDF NSFLRNEMED AFEISLDSVF DVKSVRSALE
SRRHKLESEL EQMEHIQSKF ASIHSQLEAS SGNSPATQAA AKAAVGGTTF HSAAAREMAA
QDEILRASHD LSEGIAEAKA QFAQYSPNPM RGSKAYEPSP RRSSAGRSRD RAPLSPMFHN