Gene OSTLU_34179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34179 
Symbol 
ID5000789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp852779 
End bp854512 
Gene Length1734 bp 
Protein Length577 aa 
Translation table 
GC content56% 
IMG OID640416210 
Productpredicted protein 
Protein accessionXP_001417073 
Protein GI145345125 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.137613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.109742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGG TGGAGAAGGC GTGCGAAGGA TTCAAGGCGA CGTACGAGGG GACGCTCGAC 
ACGTCGCTGC TCGCGGCGAT CGAAACGTAC GAGGGCATCG AGCAAACGCT GACGACGGTG
CTGAGCTACA TGTCGTTGAG CGCGGACACG CGGTTGATGG ATGATGCGGT GCAGCAAAGA
AAAGGGGCGC TGATGCAGCA GTACGGACAG GCGAGCGGGA ATTATTTGAC GTGGTTCGGA
CTCGAGTTGG CGGATATGGC GGAGAGCGCG TTGGAGGCGC AGTACGCAAA GGACGCAAAG
TTGAAGAATT ACAAGCCGTA CATCGATGAG GTGCGTCGAT CGGCGCCGTA TAACTTGTCC
AAAGAAGTCG AGCGGGCGTT GACCGTGCGC AGCTCGTTCA CGGGCAAGGG CAAGGTTATT
GAATTCTACG GCAACGAGCT CTCTCGGTTG CAGTTCGACC TCGACGGCAA AAAGGTCAAC
ATGGAAGTTT TGCTGGCGAA GATGACGAGC AGCAAGGACA CGATGGTGCG TCGCCAGTGC
ATGAAGACGC TCAACGATGG ATTGAAGGAT TTTGAGAGAA TCTGCGCGCT TTCGCTCAAT
ATGGCTGCCG GGTCCTGGGC AATCGAAAAC AAGGAGCGCG GGTTTAAGAA TTTGCGTTCG
TCGCGCAACC TGAGCAACAA CTTGCCCGAC GACGTCGTCG ACTCGCTCTT GGCCGCCGTG
GCGACCACAG GTGTGGATTA CTGCAAGAAG TACTACACGT TGAAGAAACA AATCTTGAAG
TCGACGCAAG GACTTGAAAC CTTCACTTGG GCCGATCGCA ACGCTTCCAT CGACATCGGT
TCGGGCGAGG ACAACTACTC GTGGGAACAA GCGGTGGAAC TCGTGCACAA GGGCTACAAC
AAGTTTTCTC CCACTATGGC GGACATGTTC ACGAAGTTCG TCGACGAAAA GCGCATCGAC
GTGCCGGCGC AAGATGGGAA AAGAGGCGGC GCATACTGCT CTTCGGCGTA CGGCTCCGGC
CCGTTCCAGC TTCTCAACTT TACCCACTCC GCGCGTGACG TCGCCACGCT GGCGCACGAG
TCTGGGCACG CAGTCCATTT TGAGTTGAGT TACCCGCAAG GCATCTTGCA ATTCCACCCG
CCGCTCACGC TCGCGGAAAC CGCGAGCATC TTTGGAGAAA TGATTGTGTT CCGTGACTTG
TTGGAGAAGA CTCCGAGCGA TGAAGATCGT TTGGCGATGA TCATGTCCAA GGTGGACGAC
ATCATCAACT CCGTCGTGCG TCAATGCTCT TTTGATAAAT TTGAAGAAAA GGTGCACACC
ATGCGCGCCA AGGGTACGGT GACGCCGGAG GAAATGTCCA AGGCGTGGCG CGAGTGCACG
GAAGAGTATT ACGGCAAGGA GGGTGAAATC TTCGACTCTC TCGAAGACAC TTCTCACCTT
TACGCGTACG TCTCGCACTT CCACAACGTC GCCTTCTACG TCTACGCTTA TGCATTCGGT
GATCTCTTGG TCGGCTCGTT GTACGGCGCG TACATGAAGC AACCGGAGGG CTTTGAAGCG
AAGCTTCTCG ATCTCTTGCG AGCCGGCGGC ACGAAGGACT TCGTCGAGGC GGTCGAACCG
TTTGGTTTGG ACCCGAAATC GGCGACGTTC TGGAGCGACG CGTTGCATGC CCACCTCGGT
GGTCTTATGG CCGAGGCTGA AGAGTTGTCC AAGAAGCTCG GATACTCTTC GTAG
 
Protein sequence
MSAVEKACEG FKATYEGTLD TSLLAAIETY EGIEQTLTTV LSYMSLSADT RLMDDAVQQR 
KGALMQQYGQ ASGNYLTWFG LELADMAESA LEAQYAKDAK LKNYKPYIDE VRRSAPYNLS
KEVERALTVR SSFTGKGKVI EFYGNELSRL QFDLDGKKVN MEVLLAKMTS SKDTMVRRQC
MKTLNDGLKD FERICALSLN MAAGSWAIEN KERGFKNLRS SRNLSNNLPD DVVDSLLAAV
ATTGVDYCKK YYTLKKQILK STQGLETFTW ADRNASIDIG SGEDNYSWEQ AVELVHKGYN
KFSPTMADMF TKFVDEKRID VPAQDGKRGG AYCSSAYGSG PFQLLNFTHS ARDVATLAHE
SGHAVHFELS YPQGILQFHP PLTLAETASI FGEMIVFRDL LEKTPSDEDR LAMIMSKVDD
IINSVVRQCS FDKFEEKVHT MRAKGTVTPE EMSKAWRECT EEYYGKEGEI FDSLEDTSHL
YAYVSHFHNV AFYVYAYAFG DLLVGSLYGA YMKQPEGFEA KLLDLLRAGG TKDFVEAVEP
FGLDPKSATF WSDALHAHLG GLMAEAEELS KKLGYSS