Gene OSTLU_14688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14688 
Symbol 
ID5000923 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp609224 
End bp611755 
Gene Length2532 bp 
Protein Length694 aa 
Translation table 
GC content64% 
IMG OID640416344 
Productpredicted protein 
Protein accessionXP_001416997 
Protein GI145344971 
COG category[R] General function prediction only 
COG ID[COG4099] Predicted peptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.228667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.32503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCTC GCGCGTCGGC GCGTCCGCGC CTCGCCCGCG TCGCGCGCGG CGTCGTCATC 
GCATCGCTCT GCGCCGTCGC GCGCGCGGCG AATTTGACGC AGATCGCGTC TCCGTTCGCC
GACGTCGACG CGCGAACCGT CGAATTCATC ACCCCCGCGG CGCACGCGTC GTCCGCGCCG
TCGGCGCTGT CGACGATCGC GTACGTCCTC GGCGCGCGAA CGATCAGCGC GCTGACGCTC
ACGACGTCGC CGGTGACGCT GACGACGATC GCGACGCTCG ACGAGGACGC GACGCCGTGC
GAAGGGCTGT TCGCGTACGA GAGCGCGCGG GACGGCGTCG CGGCGGGAAA TCGAGCGCGC
CTGGCGGTGA CGTGCGCGGG AACGAATTCG TTGTTGATAT ACGGCGTCGA TGGCGATGGA
ACGCGAGCGA CGAAAGCGAG ACAGATACAC GCGGTGAAGA ACGCGACGAG AATCGCGGAG
CCGACGGATG TGGTGGTGGT GAACGGGACG CTGGCGTTCG TGGCGAGCCG AGCGCACGCG
CGGGTGGTCT TCGTGAAGTT GGATCCGAAC GAAAACGACG CGCCGACGAT CACGGGATCG
ACGTATCAGC TCTCGGGAAT CGATAAAATA CAGTTAGGAC CGTATGATAG GCAAACGCTG
CGGACGATGA GCGGGATGAC GCGGAGAGTG ACGACGTTAA AGTACACCTC GGCGGGGACG
ATGCAACTCG TCGGGAGCGT CAAGGATAGT CGGTTGGAGA GCGCGAGCGG CGCGTGCGTG
GGGGTGGGGG GCGGCGAACG CGCGTACACG TACGTGGTGA CGCCGACGTT GAATGGGGGG
ACGTTTTCGA TATTTAACAG CTCGACGTAC GGCGCGCCCG AGTATTTGAA CGGGGTGCAC
GCGGGGCGAC TCGCGCCGAA CGAACCGTCG GATGGCTTTG AGTTAGCGCC GGCGGCGAAC
CGAGATTTAT CGGGCGCGCG CGACGTTAGC GTCCACGGCA CCACCGCGTA CGTCGCCGCC
AAAAGTATCG GTGCCATCGT CGTCGTCGAC ATCACCGACC CGAACGTACC AGTCGTGTTG
GAAAAGGCTC GATCCTCCGC CCTCGCCGGC GTCGACCGCG TCGTCGCGTC GCCCGACGGC
GTCTTCGTCG TCGCCGCCGT CAACGCCACC GACGATGGCG CCAACGCCAC GATCGTCATC
GCTAGCAAAG ACGACACCGC CATCGCCAAG CACGGTGCGT ACACCGGCGC TAAGAAGCGC
CGACGCCTAT TCTCCTCGTC AAAGTAACTA GCCGACGCGA GCCCGCGCAG CCGTAGCCTT
TGGAGTGCGA AAACCTCTTT TCGCCGAATT CGCCACCGTC ATTCGCCCGC GCGCGACCGA
GCGTCGACCG CGCGCGCGAT GACGAGCGTA CGACCGCAGT GCGCTCTTTC GCGCCTCATC
GTCGACGCCC GCGCGACGAA ACGACGCGCG TCGTCGTCGT CACGCGCGGC GGCGGCGGAC
GGCGCGGCGG ACGCGACGAC GACGTACGTC CCGGGGAACG CGACGCTGCG CGAAGCGCAC
GAGTACGGCG CTGAACACGA ATACTCGAGG TGAGTCGACC GCGCGCGGAA CCACGACGAC
GCGGAAACGA CGCGGAAACG ACGCGAGAGG GGATGAGCGG CTGAATGCGA GACTGACCGG
GACGCCGACG CGGCGCCGGC GCGCAGTTTT GATGCGTTAC AACCGCACAG GGTGCACTTT
CCGAACGCGA ATTACGATTA CGACGTCAAG ACAAAGGTGG CGACACCCAC GCGGTATCGA
GTGAAGTTGC CGAAAGGGTA CGATGGGAAC AGGAAAGAGC CGTACGACGC GTTGATTTGC
GTGCCGGGAG AGGCGGGATT CGGGCCGAGG GAGGGGAAGG TATCTTTGAC GTCGAGCGTG
GCAAAGAAGA AGTGGAACGA CGCGAAGGAT GTAATTTTGG TTGAGTTGGC GTTTAACACG
CCGACGTGGC TGAACGATTC GGCGTCGATG AACCACGAGA GCTATTTGCT CAAGGTGGTG
TTGCCGCACT TCTTGGCGGC GCACAACGTG GGCAAGATGT CGCTTCTTGG CTTTGGTTCA
GGCGCGTACG GGGCGCTGAG TTTGCTCATG CGACGTCCAA CGGCATTTCA CGCCGTTATC
GCCGCTGATG TACCGATCTT AGGTGGGTTT AAGATGATAG AGCGAGCGTG GGGACGCGAA
GATCTCGAGC GCGAGGCGAA CTGGGGGTCG TGGAACGAGG CGTTCCCGCA AGACGACGAC
TGGACCCCGT ACGACGTCAC CACCATCGCA CGCGACGCGT GGGTTCGCGA CCATCTCAAC
GCCACTGAAT TGTCTAGAAT CGCTCTCATC CCCGGGAGCA AAACTGCGAA CGAGCTCGGC
GACTTTGCTC GCCAACTCGA AGCCGCCGGC GTCGCGCACG ACGTCATCGA AGGTTTCGAG
AGCGTCGACA TTCATCACGA GGGCGAGTGG ATCGATGCCG CCCTCGGATG GCTCGCGCCG
AGACTCGCGT AA
 
Protein sequence
MAPRASARPR LARVARGVVI ASLCAVARAA NLTQIASPFA DVDARTVEFI TPAAHASSAP 
SALSTIAYVL GARTISALTL TTSPVTLTTI ATLDEDATPC EGLFAYESAR DGVAAGNRAR
LAVTCAGTNS LLIYGVDGDG TRATKARQIH AVKNATRIAE PTDVVVVNGT LAFVASRAHA
RVVFVKLDPN ENDAPTITGS TYQLSGIDKI QLGPYDRQTL RTMSGMTRRV TTLKYTSAGT
MQLVGSVKDS RLESASGACV GVGGGERAYT YVVTPTLNGG TFSIFNSSTY GAPEYLNGVH
AGRLAPNEPS DGFELAPAAN RDLSGARDVS VHGTTAYVAA KSIGAIVVVD ITDPNVPVVL
EKARSSALAG VDRVVASPDG VFVVAAVNAT DDGANATIVI ASKDDTAIAK HGAYTGAKKR
RRLFSSSKVH FPNANYDYDV KTKVATPTRY RVKLPKGYDG NRKEPYDALI CVPGEAGFGP
REGKVSLTSS VAKKKWNDAK DVILVELAFN TPTWLNDSAS MNHESYLLKV VLPHFLAAHN
VGKMSLLGFG SGAYGALSLL MRRPTAFHAV IAADVPILGG FKMIERAWGR EDLEREANWG
SWNEAFPQDD DWTPYDVTTI ARDAWVRDHL NATELSRIAL IPGSKTANEL GDFARQLEAA
GVAHDVIEGF ESVDIHHEGE WIDAALGWLA PRLA