Gene OSTLU_31795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31795 
Symbol 
ID5001778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp544398 
End bp547376 
Gene Length2979 bp 
Protein Length992 aa 
Translation table 
GC content54% 
IMG OID640417199 
Productpredicted protein 
Protein accessionXP_001417805 
Protein GI145346665 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.163312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACT ACGTGGCGGA GAGCCAGAAG CCGAGGGAAC ACGCGGCGCT GGCGCTGGCG 
GTGGACGCGG GGTCGGTGTT CGAGGGCGAG GGCGAGCGAG GGGCGGCGCA CGTGGTGGAA
CACCTGGCGT TTCGATGCAC GGAATCGTAC GAACACTTTG CGATTGTGAA CTTTTTAGAG
TCGATCGGGG CGGAATTCGG TGCGTGCTCG AACGCGTACA CGAGCATGGA TGAGACGGTG
TACGAGTTGA CGATTCCGAC GCAAAAGGCG GAAGTGTTGG CGACGTCGAT GCATATTTTG
AGCGAATTCG CGAGCGCGGT GCGGATATCG AACGAGGATG TGGCGTGCGA ACGAGGGTCC
GTGATGGAAG AATGGCGTTT AGGACGGGAC GCGCGAGGAC GCGCGGCGGA GGCGTATTGG
AAGACGTTGA TGGAGGGGTC GTTGTACGCC GAACGCTCGC CCATCGGATT GGAGGACTTT
ATACAAAACG CCGACCCGCA GGTTTTGCGA GACTTTTACG CCAAATGGTA CCGACCTGAA
CGCATGGCGG TGATTGCGGT TGGAGATTTT CAAGACCTGG ACGACGTCGT GAGCCTGATC
GAGAGCACGT TTCAAGACTT GAAGCCGAAA GAAGGGCAGC CCGCGGAGAA TCCAGTCATG
GAACGACCAA AAAACTCCGC GATGGAGCAT TCCGAACCGC GCGTCGTGAC GCACGTCGAT
CGCGAGTTGA AGCAGACGGC GGTGACAGTG ACGTTCAAGT ACGCGAGTAT TCCGGTGGAC
ACTCCGCGCG GGTATTATTT GAAGACGGTC GAGGATATTT ACAAGACGGC GCTCGATAAT
CGATTGTATC GCATGATGCG TCAACCAAAG CCCCCATTTT TCAGCGCGGG TGGCATCATC
GAGGACGCGA CGAGGACGAC GACTTTACTC AGCGTGCAGG CGACGTGCGC GGAGAGCCGT
GCGAGCACAG GTTTAGAGGC GTTACTTCGC GAACTCGCGC GTATTCGATT GCACGGAATT
TCGGAGCAAG AGTTGAAAAT CGCCAAGTCG CGCATGCTCG CCGATACAGA GCAATTGTAC
GCAGAACGCG AGCAGACATA TTGTGAGTCT GTTCGCGATG AGCTAGTGTG CCATTTCTTG
CGCGGTGATC TCGTCATCGG AGCTGAAGAC GAGGCGGCTC TTGCCAAGGC GTGCATTGAG
CGCGTGTCAC AAGAAGACGT GTTGGCGTTT GCGCGTCAAT TGAACGTGCG TAACTCGTGC
GTGATTCGCG TACAAGAAGG TAGAAAGCGT ACAAGTGAAG ATGATTTGCG AGAAGCGATC
GAGAATGTCC GCTTGAGGGA AATTGAGGGT GCAATTGACC AAAGCGAAGT GTTTGATATT
CCCGAGGTAT TGATGGACGC GACTTCATTG ACTTCTGGCA CCATCGTCGG CTCGCGAGAG
TTACCGGCGT TGGAGGTGAA TGAGATCACC CTGAATAACG GTATGCGCAT CGCCATTCGC
GTGACTGATT TTCTTGACGA TCAAGTCCTC ATACGTGGTG TCGCACGAGG TGGCCTTTCG
GAGGTAGCGC AGATTGATTA CATCGATGCG ATGTGCTCAA ACATGGTCGC CAGCGAACTT
GGCATCTACG GCCATCGACC GGATGTCTAC GACGGTATCA TAGCGGGTCT AAGATCGGAC
GTGCACGCCA ACGTAACCAT GTATCGCCGT AATATTGAAG GTGAAACATC ACCAGTGGAC
ATCGAAAGCG CGCTGCAGTG CATTCATCTT TTGTTCACGC ACGACGTGAG CACGACGAAT
GATCCGGAAG TTTTAGAGAC ACTGATGCAG ATGCAGGAGG AAAAAATTAG AAATCAAAGT
CGAGACCCCG AAAGTAAATA TAGCGAGGTC GTTCGCTCGC TCGTCTACGG CGAGTCGTAC
CATAGTCAGC GAATTACCGT CAAGTCGTTG CGTGAGATGG ACAGCAAAAA GGCTTGCGCG
TTCTTCGACG CTTGCTTCTT GGATCCGTCT GAATTTACCA TGGTTTTCGT CGGAGCGATC
GATTCGAAGA CACTCGTTCC GCTCATCGAA AAGTATCTCG GCTCGATTCC GCCGGCGTCA
CCCACCAAGG TTCTCAAGGC CTTTGAAGGT ATTAGTCAAC GCAAACGCAG CTTGACACCG
TTCCTGCTGA AGTTCCCGAC GCGCGTCATC TCGCGCACTG TGCGAGCGCA CATGCGGGAA
GGGATGTCTA AGGCGTCGAT TACGTTTCCC GTGCGCATAC AAAATCCGGA CTTTCACAAC
AGTCGCGGAC GTTCGACGCT CTTGGGCGGA AAAGAGTTGA CCGTGGCAAA GTTTAAGACG
GTCATGACGG CGGCAATCAT CGAGAGACGA TTGCTGGCTT TGCTGAGATT TGAATACGGC
GAGATTTACA CCTGCCACGC GGATGCATCG TTCGGCTACC AAGACCCGGA TGTCGCTGGT
GAAATGTACT CGGGCGATAT CATGGTATCA TTCTCGTGCG CTCCGGAGCG AGGCGCTCAC
CTCGCGGCAC ACGCCCGAGA AGTCGTGAGA CATCTTCGCG AACACGGTCC GACGGAGGAA
GACGTGCACG CCGTTCGCGA ATGCGAAATT CGAGACTTTG AAGTCAGTCG ACAAGAGAAC
ACATTTTGGC GCGAGTATAT CACCGAACTC TATAAATCGC GGATGATGCA CAAGAGTATT
CTGAACGGCG ATATCGAAGC GCTATATCGA ATGACTGAAG AAGTGCGAGA GGAAGTTATC
GAGTCCCTCT CCCCGGCGGT GATTCGCGAG CATTTACAAT GCGTCATGAG CATGAATAAT
TCCGTTACCG TCGTTCTCAA GCCGCAGCGA TCGCTCTTGC GACGCATCTT CGTTCCATCG
TTCGAAACCC GCGGAGAGGC GATTTACTCC GCGGTTTACT TATCAGGAAT CGCGCTCACT
GCGAGCGCGA TATATGCGAG ATGCCACAAG AAGGACTGA
 
Protein sequence
MAYYVAESQK PREHAALALA VDAGSVFEGE GERGAAHVVE HLAFRCTESY EHFAIVNFLE 
SIGAEFGACS NAYTSMDETV YELTIPTQKA EVLATSMHIL SEFASAVRIS NEDVACERGS
VMEEWRLGRD ARGRAAEAYW KTLMEGSLYA ERSPIGLEDF IQNADPQVLR DFYAKWYRPE
RMAVIAVGDF QDLDDVVSLI ESTFQDLKPK EGQPAENPVM ERPKNSAMEH SEPRVVTHVD
RELKQTAVTV TFKYASIPVD TPRGYYLKTV EDIYKTALDN RLYRMMRQPK PPFFSAGGII
EDATRTTTLL SVQATCAESR ASTGLEALLR ELARIRLHGI SEQELKIAKS RMLADTEQLY
AEREQTYCES VRDELVCHFL RGDLVIGAED EAALAKACIE RVSQEDVLAF ARQLNVRNSC
VIRVQEGRKR TSEDDLREAI ENVRLREIEG AIDQSEVFDI PEVLMDATSL TSGTIVGSRE
LPALEVNEIT LNNGMRIAIR VTDFLDDQVL IRGVARGGLS EVAQIDYIDA MCSNMVASEL
GIYGHRPDVY DGIIAGLRSD VHANVTMYRR NIEGETSPVD IESALQCIHL LFTHDVSTTN
DPEVLETLMQ MQEEKIRNQS RDPESKYSEV VRSLVYGESY HSQRITVKSL REMDSKKACA
FFDACFLDPS EFTMVFVGAI DSKTLVPLIE KYLGSIPPAS PTKVLKAFEG ISQRKRSLTP
FLLKFPTRVI SRTVRAHMRE GMSKASITFP VRIQNPDFHN SRGRSTLLGG KELTVAKFKT
VMTAAIIERR LLALLRFEYG EIYTCHADAS FGYQDPDVAG EMYSGDIMVS FSCAPERGAH
LAAHAREVVR HLREHGPTEE DVHAVRECEI RDFEVSRQEN TFWREYITEL YKSRMMHKSI
LNGDIEALYR MTEEVREEVI ESLSPAVIRE HLQCVMSMNN SVTVVLKPQR SLLRRIFVPS
FETRGEAIYS AVYLSGIALT ASAIYARCHK KD