Gene OSTLU_35099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35099 
Symbol 
ID5003803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp414541 
End bp416496 
Gene Length1956 bp 
Protein Length651 aa 
Translation table 
GC content61% 
IMG OID640419224 
Productpredicted protein 
Protein accessionXP_001419590 
Protein GI145350390 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.235498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000591243 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGAACG GCGCGTTCGC CGGGGCGGCG CGCGCGGACG CGTTCAACGC GGCGCCGACG 
GAACAAGTGC AACAACAGCG CGGTGAGGCG GTGTCCGCGT TCGCGGATGC GTCCAAGGAA
GCACCGGCGG TGAACGCCGA TGGGTTGCCG GAAGGGAACA ACTGGCGTTA CAGCGAATTC
ATCAAGGCGG TGATGAGCGG CAAGGTTGAG CGCGTGCGCT TCTCCAAGGA TGGCTCCGCG
TTGCAGTTGA CCGCGGTCAA CGGTGCGCGC GCGACCGTCA TCTTGCCCAA CGATCCGGAA
CTCGTCGACA TCCTGGCCAA GAATGGTGTC GACATCTCCG TCAGCGAAGG CGAGCAACAG
GGCAACGCCG CCTCTTTGGT CGGCAACTTG TTGTTCCCGC TCGTGGCGTT CGGAGGTTTG
TTCTTCTTGT TCCGCCGCGC GCAAGGTGGC GATGGCGGCA TGGGTGGCAT GGGTGGTATG
GGCGGTGGAC CCATGGACTT CGGTAAGTCC AAGTCCAAGT TCCAAGAAGT TCCGGAAACC
GGTGTGACGT TCGCGGACGT CGCCGGCGTC GAAGGCGCCA AGTTGGAATT GCAAGAAGTC
GTCGACTTTT TGAAGAACCC AGACAAGTAC ACCGCGCTCG GTGCCAAGAT CCCGAAGGGT
TGCCTTTTGG TCGGTCCGCC GGGTACCGGT AAGACCCTGA TCGCCAAGGC GGTCGCCGGT
GAAGCCGGTG TGCCGTTCTT CTCTTGCGCC GCGTCCGAGT TCGTCGAACT CTTCGTCGGC
GTTGGCGCGT CTCGCGTTCG CGACTTGTTC GAAAAGGCCA AGGCCAAGGC TCCGTGCATC
ATCTTCATCG ATGAAATCGA CGCCGTCGGT CGCCAACGTG GCTCCGGTAT GGGTGGTGGC
AACGACGAGC GCGAACAGAC CATCAACCAG CTTCTCACCG AGATGGATGG TTTCGAAGGC
AACACGGGCG TCATCGTCCT TGCGGCGACG AACAGACCGG ACGTGCTCGA TAGCGCGCTC
CTTCGCCCGG GACGTTTCGA TCGTCAAGTT ACCGTCGATC GTCCGGACGT CGCTGGTCGC
ATCCGCATCC TCAAGGTGCA CGCCCGTGGC AAGACTTTGG CCAAGGACGT CGACTTCGAC
AAGATCGCTC GCCGTACGCC GGGTTTCACG GGTGCCGATT TGGAAAACCT CATGAACGAG
TCCGCGATTC TCGCCGCGCG CCGTGAACTC ACGGAAATCT CCAAGGAAGA AATCGCCGAT
GCTCTCGAGC GCATCATCGC CGGTGCCGCC AGAGAAGGTG CCGTCATGTC TGAGAAGAAG
AAGAAGCTCG TGGCGTACCA CGAAGCTGGC CACGCGCTCG TCGGGGCCCT CATGCCGGAT
TACGACGCCG TGACGAAGAT TTCCATCGTC CCGCGCGGTA ACGCCGGTGG TTTGACTTTC
TTCGCCCCGA GCGAAGAGCG TCTCGAATCT GGCTTGTACT CTCGCACGTA CCTTGAGAAC
CAAATGGCTG TCGCCATGGG TGGTCGCGTC GCCGAAGAAC TCATCTTCGG CGCTGAAGAC
GTCACCACGG GCGCGTCCGG TGATTTCCAG CAAGTCACCC GCACCGCGCG TATGATGATC
GAGCAAATGG GTTTCTCCAA GCGAATTGGT CAAATCGCCA TCAAGTCTGG CGGCGGTAAC
TCTTTCCTTG GCAACGACAT GGGTCGCGCC GCTGATTACT CCGCCGCCAC CGCCGCCATC
GTCGATGAAG AAGTCAAGAT CTTGGTCACT GCGGCCTACC GCCGCGCCAA GGACTTGGTT
CAATTGAACA TGGACGTCTT GCACGCCGTC GCGGACGTGT TGATGGAGAA GGAGAACATC
GACGGCGACG AATTCGAGCG CATCATGCTC GGCGCCAAGT CGGAGCTCTA CCTCAAGGCG
GACGAGCCTT CGGTCGCAGT GCCGTACCAA AACTGA
 
Protein sequence
MTNGAFAGAA RADAFNAAPT EQVQQQRGEA VSAFADASKE APAVNADGLP EGNNWRYSEF 
IKAVMSGKVE RVRFSKDGSA LQLTAVNGAR ATVILPNDPE LVDILAKNGV DISVSEGEQQ
GNAASLVGNL LFPLVAFGGL FFLFRRAQGG DGGMGGMGGM GGGPMDFGKS KSKFQEVPET
GVTFADVAGV EGAKLELQEV VDFLKNPDKY TALGAKIPKG CLLVGPPGTG KTLIAKAVAG
EAGVPFFSCA ASEFVELFVG VGASRVRDLF EKAKAKAPCI IFIDEIDAVG RQRGSGMGGG
NDEREQTINQ LLTEMDGFEG NTGVIVLAAT NRPDVLDSAL LRPGRFDRQV TVDRPDVAGR
IRILKVHARG KTLAKDVDFD KIARRTPGFT GADLENLMNE SAILAARREL TEISKEEIAD
ALERIIAGAA REGAVMSEKK KKLVAYHEAG HALVGALMPD YDAVTKISIV PRGNAGGLTF
FAPSEERLES GLYSRTYLEN QMAVAMGGRV AEELIFGAED VTTGASGDFQ QVTRTARMMI
EQMGFSKRIG QIAIKSGGGN SFLGNDMGRA ADYSAATAAI VDEEVKILVT AAYRRAKDLV
QLNMDVLHAV ADVLMEKENI DGDEFERIML GAKSELYLKA DEPSVAVPYQ N