Gene OSTLU_37419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37419 
SymbolCTPA 
ID5001647 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp874359 
End bp875777 
Gene Length1419 bp 
Protein Length446 aa 
Translation table 
GC content60% 
IMG OID640417068 
ProductD1 proceesing peptidase 
Protein accessionXP_001417628 
Protein GI145346296 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.299887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCCGA CGGCGCGCGC GCGCGCGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG 
GTGGCGGCGG CGACGACGTT CGGGACGCCC GCGGCGTTCG CGGATGACGT CGCGCGCGGT
CGAGGGGACG CGCGGACGAA ATCTGTGTCG TCGGCGGTGG AACTGGTGAG CGAGATCGCG
GCCGAAGCGG CGGAGGCGGA GGCGGAATCG GAGGGCGCGA CGGACGCGAC CATCTTGGAC
GAAGCGTGGG GGTTGGTTTT CGACAACTTT TTACCGGCGA GAAAATCTGA GTCGGACGGA
TTCGATCGCG CGGCGTGGGA GGCGATCAAG GCCGAACACG AGGCGAATCC GCCTCAAAGT
CGCGAGGAGG CGTACGAGAT GATTAAGTCG ATGCTGGGGA CGCTCGGGGA TAAGTTTACG
CGCTTCATCG AGCCGGATCG GTTCACTTCG ATGTTGAAAT ACGACATCAC CGGCGTCGGT
TTGAACATCG CGGAAGATGC GGACGACCCT GAACGCGTGC GCGTGCTGGG AATGGTGCTC
GACTCGAGCG CGATGAAGGC TGGAGTGGCG CAGGATGATG AAATCGTCGC CGTCAACGGC
GAACTCGTGC GCGGCTTGAG CGCGTTTCAG GTGTCTTCGC TCATTCAAGA GGCTGACGGG
AAGAGCGTGG ATCTAACAAT CTCGCGCACA GGCGAAGACG TCCCGCGCGT CGTTTCTCTG
ACGCGAGACA GTCAATTCGA AGCGCCGAAA AGTCCAGTGA GCATGCGTCT GGAGGGCGGA
CACGTCGGTT ACATTCGGCT TCGCGAGTTC AACTCGCTCG CCGAGCGCGA TATCGCGAGA
GCGATCACGG ATTTAAGGAC GCAAGGAGCA GACGCGTATA TTCTAGACTT ACGCGACAAT
CCTGGGGGAT TAGTGCAAGC TGGTGTGGAG ATTGCTCGAT TATTTTTACC TGCGGATTCG
ACCATCGCGT ACACCGAAGG TCGAGTCGTC GCCGGAGGCG TCAAACGCGA TACCGACGTC
TCGGCGACAA AAACCGCGAG AAACGGATCT GATTCTCAAC TACCGACTAA GCTGAAGGCG
ATCACGACGT CGAAAAATGA CCCTGTCGTC GCCGCTGACG TTCCGCTGGT TGTTCTTGTC
AACGGCAGAA GCGCTTCTGC GAGCGAAATT TTAACCGGCG CTTTGAAGGA CAACTGTCGA
GCGACTGTGG TCGGGAGTAA GACGTACGGC AAGGGTTTGA TTCAGAGCGT GTACGAACTC
AGTGATTTGA GTGGGATGGT ACTCACCGTG GGTAAGTACG TCACCCCAGG TCTCGTCGAC
ATCGATCAGA CAGGGATTTC GCCAAACTTT ATGATGTTCC CGGGCTTTGA CGCCGCGGCG
AGAGAAATCG ACGCGTGCAA AGTGCCACCA AAATATTGA
 
Protein sequence
MGPTARARAA AAAAAAAAAA VAAATTFGTP AAFADDVARG RGDARTKSVS SAVELVSEIA 
AEAAEAEAES EGATDATILD EAWGLVFDNF LPARKSESDG FDRAAWEAIK AEHEANPPQS
REEAYEMIKS MLGTLGDKFT RFIEPDRFTS MLKYDITGVG LNIAEDADDP ERVRVLGMVL
DSSAMKAGVA QDDEIVAVNG ELVRGLSAFQ VSSLIQEADG KSVDLTISRT GEDVPRVVSL
TRDSQFEAPK SPVSMRLEGG HVGYIRLREF NSLAERDIAR AITDLRTQGA DAYILDLRDN
PGGLVQAGVE IARLFLPADS TIAYTEGRVV AGGAITTSKN DPVVAADVPL VVLVNGRSAS
ASEILTGALK DNCRATVVGS KTYGKGLIQS VYELSDLSGM VLTVGKYVTP GLVDIDQTGI
SPNFMMFPGF DAAAREIDAC KVPPKY