Gene OSTLU_28244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28244 
Symbol 
ID5006250 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp64828 
End bp66531 
Gene Length1704 bp 
Protein Length526 aa 
Translation table 
GC content59% 
IMG OID640421671 
Productpredicted protein 
Protein accessionXP_001422089 
Protein GI145355699 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.719833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GACGCGGCGA GGAGGTCGAG AGATGTCTCG CGCGCTCGTC GCGCTGAGCG CCGCGCTCGC 
GCTGAGCGCG CGACGCGTCG ACGCGAACGA TGTCGTCGAC GACGCCGCGC GGACGCGAGA
CGCGCTGACG TCGCGATTGA CGACGAAAAC GGTGCGACTC GAACGATTCG CGACGGATTT
GGAATCTCTA GCGGCGAATG ATTACGACGA GTACGCCGCG TCGAGCGGGT ATTTCGCGCT
CAATCGCACG ACGAAGGACG CGCACATGTT TTACACGTTT TTTGATGCGC GCTCGGGAGG
CGCGGAGAGC GAGGACGCGA TCCCAATCAT TCTGTGGCTC ACGGGCGGAC CGGGATGTTC
TTCGGAATTG GCAGCGTGCG TGTCGTCGGC GCGCGCGCGC GCACGCGCGA TCTTTCGCGT
CTTGAACGAT ACAGGAGAGG GACGCGACAC TGACTGACCA CCGTCTTCCA CACTAGTCTA
TACGAAAACG GACCGTTCGC GTTCGACGAA GACGACGCGA CGAAATTGAA GCGACGCAAA
TACGCGTGGA ACGACGCCGG AAGGTTGCTT TACGTGGACT CGCCGGTGAA CACGGGATTT
TCGTATTCGA GCTCGCGGCG CGACGCGGCG AAGGACGAGA CGACGGTGGC GAACGATTTG
TTGGAGTTTT TGTACGCCTT CATGTTGAGT AGACCGATGC TCGTGGATGC GCCGGTTTAC
GTCACGGGAG AATCGTACGC GGGGCACTAT GTGCCGGCGT TCGCGAGGGC GATTTTCGAC
GCCAACGCTC GAGACGATGG ACCCGTGAGA ATAAATCTTC AAGGCTTAGC CATCGGAAAT
GGGCTGACGG ATCCCGCTAT TCAGTACGCG GCGTACGCGG ATTATTCGCT CGGGAACGAC
ATCGTGAGCG CGGCGACGGT GAAGCAAACG GCGAAGAAAT TACCGTCGTG CGTGGAGAAA
ATCAAGTCGT GCGCGAGCGG TAAAACGTCG AGCAAGGAAA ATCGCGCCGA ATGCTTGGAC
GCGGTGGATT CGTGCCAAGC CATTCCTGAG GCATTGCTCG AAGATGCCGC TGAACGCAAC
GGTGGGAAGG CAATCAACGT GTACGACATA CGTAAATCGT GCGACGCCGA GCTTTGTTAC
GATTTCAGCG CCGCGGAAGC GTTTTTGAAC CGTAAAGACG TTCAAGAAGC GTTCGGGGTG
AGTAAGAAAT GGGAAATGTG CGACGCGAGC GTGCACCAAG ATATGATGGG GGATTGGATG
CACGACTACG AGACGTTGAT TCCAGACATG ATCGAGGCCG GGATTCGCGT GATGATTTAC
GCCGGCGAGG ATGATTTCAT CTGCAACTGG CTCGGAAATC TTCGCTGGGT GAAGGCGATG
CAATGGAACG GACGCGAAGC GTTCAACGCC GCACGTCCTG AGCCTTTCAT CATCCAGGGT
GCGGGTGACG GCGAAGACGA CGTTGTGGGT GGCGACGTGC GCGAACACGG CGGTTTATCC
TTCGTAAAGA TCAGCGAAGC GGGACACATG GTACCTATGG ATCAGCCACG AAACGCGCTG
ACAATGATTC AGCGCTTTGT GAACAACGAA CCGATCGCGC GCGGTCGAGG TGGTGACGAG
CCGAAACTCT CCGCTGCACC ACGACGTTTC GGCCCCGTCG AAGACGACGT CGTTGGCCGC
CTCGCTGTGG CGACTCAGAA ATGA
 
Protein sequence
MSRALVALSA ALALSARRVD ANDVVDDAAR TRDALTSRLT TKTVRLERFA TDLESLAAND 
YDEYAASSGY FALNRTTKDA HMFYTFFDAR SGGAESEDAI PIILWLTGGP GCSSELAALY
ENGPFAFDED DATKLKRRKY AWNDAGRLLY VDSPVNTGFS YSSSRRDAAK DETTVANDLL
EFLYAFMLSR PMLVDAPVYV TGESYAGHYV PAFARAIFDA NARDDGPVRI NLQGLAIGNG
LTDPAIQYAA YADYSLGNDI VSAATVKQTA KKLPSCVEKI KSCASGKTSS KENRAECLDA
VDSCQAIPEA LLEDAAERNG GKAINVYDIR KSCDAELCYD FSAAEAFLNR KDVQEAFGVS
KKWEMCDASV HQDMMGDWMH DYETLIPDMI EAGIRVMIYA GEDDFICNWL GNLRWVKAMQ
WNGREAFNAA RPEPFIIQGA GDGEDDVVGG DVREHGGLSF VKISEAGHMV PMDQPRNALT
MIQRFVNNEP IARGRGGDEP KLSAAPRRFG PVEDDVVGRL AVATQK