Gene OSTLU_51888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51888 
Symbol 
ID5006425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp22453 
End bp23799 
Gene Length1347 bp 
Protein Length448 aa 
Translation table 
GC content65% 
IMG OID640421846 
Productpredicted protein 
Protein accessionXP_001422415 
Protein GI145356391 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.0830441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA CGCACGCGCA CGACGCGCCG ACGGAGGCGC GCGCGACGGC GCCGCCGACG 
ACGTCGGTGA CGGTGCTCGC GAACGGGGCG ACGATCGCGA GCGAGAACAC GCCGGGAGCG
ACGCTGGCGT GCGGGGCGTA CGTGGACTGC GGGAGCGCGC GCGAGGACGC GCCGTGGAAG
CGCGGATTCT CGCACGCGCT GGAGCGCGCG GCGTTCAGGG CGACGAAACA TCGAAGTGGG
TTCAGGGTGA CGCGAGAGTG CGAGACGATC GGGGCGAATC TGAGCGCGAG CGCGAGCAGG
GAACAGTTTT GCTTCGCGGC GGATGCGCTG AAGACGCGCG CGGCGGAGAC GGTGGAATTG
TTGCTCGATT GCGCGCTGAA TCCGGCGTTG GAGAATCACG AGATCGAACG AGTGGTGGAG
AATCTGAAGA CCGAGGTGAA GGAGTTGAAC GAGAACCCGC AGGCGTTGTT GATGGAGGCG
ACGCACGCGA CGGCGTACGC GGGGGGCTTG GGGCACGCCC TCGTGGCGCC GAGCGGGGAT
CTGAGTCACA TCACGGGCGA CGCTCTGAGA GAGTTCGTGC GAGAGAACTT CACCGCTCCG
CGCGTCGTGC TCGCGGCGAG CGGGTGCGAA CACGACGAGC TCGTGCGAAT CGCGGAGCCG
ATGTTGGCGA CGCTTCCGAG CGGCGAGGGT TCGCCCGAGA CGCCGACGAC GTACGTGGGG
GGTGATTTTA GACAAAAGAG CGATTCCCCG ATCACGTCCA TCGTGCTCGG GTTTGAGTTC
AAGGGTGGCT GGCGCGACAC CAAGGCCTCG ACCGCGATGA CGGTGCTGAC GATGTTGCTC
GGCGGCGGCG GGTCGTTTAG CGCCGGGGGG CCGGGGAAAG GCATGTACTC GCGCCTTTAC
ACTCGCGTGT TGAACAGATA TTCTTGGGCG CAAAACTGCA CGGCGTTCCA CAGCATCTTC
AACGACACCG GGATCGTCGG GATCTCCGCC ATGGCGAACA GCGCGCACAC CGGTGACATG
GTGAAGGTGA TGGCGGGCGA GCTTCAAGCC GTCGCCGCGA GCGGGGGCGT GAGCCCGCAA
GAGCTCGAAC GCGCCAAGAA CGCCACGGTG AGCTCGATCT TGATGAACTT GGAGTCCAAG
GCTGTCGTCG CGGAAGACAT CGGGCGACAA ATGCTGACTT ACAAGTACCG CAAGAGTGCG
GCGGACTTCA TCGCCGAAGT GCGCGCGGTG AGCGCGCAAG ACGTGCAAAA AGTCGCGAGC
GACTTGCTCG CGAGCGCGCC CACGGTGGCC ATGACCGGCG AGCTCCACGC CGCGCCGCGT
TACGAAGACA TTAAGGCGAT GTTTTAA
 
Protein sequence
MSETHAHDAP TEARATAPPT TSVTVLANGA TIASENTPGA TLACGAYVDC GSAREDAPWK 
RGFSHALERA AFRATKHRSG FRVTRECETI GANLSASASR EQFCFAADAL KTRAAETVEL
LLDCALNPAL ENHEIERVVE NLKTEVKELN ENPQALLMEA THATAYAGGL GHALVAPSGD
LSHITGDALR EFVRENFTAP RVVLAASGCE HDELVRIAEP MLATLPSGEG SPETPTTYVG
GDFRQKSDSP ITSIVLGFEF KGGWRDTKAS TAMTVLTMLL GGGGSFSAGG PGKGMYSRLY
TRVLNRYSWA QNCTAFHSIF NDTGIVGISA MANSAHTGDM VKVMAGELQA VAASGGVSPQ
ELERAKNATV SSILMNLESK AVVAEDIGRQ MLTYKYRKSA ADFIAEVRAV SAQDVQKVAS
DLLASAPTVA MTGELHAAPR YEDIKAMF