Gene OSTLU_44098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_44098 
Symbol 
ID5004568 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp383927 
End bp385510 
Gene Length1584 bp 
Protein Length494 aa 
Translation table 
GC content56% 
IMG OID640419989 
Productpredicted protein 
Protein accessionXP_001420493 
Protein GI145352310 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0738436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTTT TGGCGTACTC CGTCGGCGGC TTGGCGTTTA TGATCGGTTC GATTTACTTG 
GGCATCGTCC GTAGGACGTC GGTGCCGCAA GATCAGTTCC AGGCGATGCA GTTTGCGCAG
TCCCGCGCGG GTGCGAGGCG AGACGGCACG GTGGACGTGA CTTTGGAAGA CGTTGGTGGG
TTGGAGAACA TCATAGAAGA TTTGGAGGAA GTCGTCGCCT TTTTAAAGGA ACCCGAGCGT
TTCGCCAAGG TTGGCGCTAG GCCCCCCAAG GGTTTGCTCA TGGAAGGAGG CCCGGGCGTT
GGTAAGACGC TGATCGCGAA GGCGATCGCG GGTGAAGCCA AAGTGCCGTT CTACTCCATG
TCTGGCTCTG AGTTTGTTGA AATCATCGTC GGCGTCGGCG CGGCTCGAGT GAGAGATTTA
TTTAAACGCG CGCGCATCAA CGCGCCGTGC TTGATATTCG TCGACGAAAT CGATGCGTTA
GGCACGAAAC GCGCCGCTGC TGGCACGCGT GGCACGGAAG AGCACGAACA AACGCTGAAT
CAGCTTCTCA CGGAGATGGA TGGTTTCACG CCAGACACTG GCGTCGTCTT CATCGGCGCG
ACAAACCGAG CCGATTTACT CGATCCCGCG CTGTTGCGGC CAGGTCGATT CGATCGTAAA
GTTCGCGTCG GCCTTCCAAA CGTCGAGGCA CGGGCTAAGA TTTTGCAGAT TCACTTGTCC
AAGCGAAACT GCAACCCAGA AATTGACACC AAGCGATTGG CGCAGAACCT TCCCGGTTTG
TCGGGCGCCG AGATTGCAAA CATTTGCAAC GAAGCCGCGG TGCACTGCGT GCGACGCCAG
GGTGAACAGA TCGAAGAGCA CGACGTCTTG GATGCCGTTG AGCGCGTCGT GAGCGGCATT
CGTCTTACAG CACACCCCAA GGAAAGCGTG ACGACGCGCA AACTCGCGGC GCACGAAGTC
GGCCACGCCC TGGTCCAGAA CTTGCTCCAT AAGAGCAACG GTCTCATCGA AGACATCGAG
ATGATTTCCA TCATTCCGCG AGGTTTCGAG CCAGCGATTA CGCTGATCCA GAGAAAGCGT
GACGAGGATT ACCGATATCC TACGCGCGCG CGAATGTGCG AGCGCGTGCA AGTTTTGCTC
GCGGGCCGCT CCTGCGAAAA AGTGTTGTTT GGAGAGGCGA GCACTCGCGG TTCGGAAGAT
GTTTGCGAAG CGAATGATTT GCTTCGAAAT ATGATCGTAA ACTTTGGCTT GGGGCAGCCC
GGCATGATGA CGACGTACAC ATACGACCCG AAGCATTTGA ACAAGTCGGA GAGACGAGTG
GCGCGCTTGC AAGGTGCGGT GTCTAAATCT GGAGAGTTGA TGCCCTTGGA CGAACTGTTG
ACGATTGCAG GTCCGATCAG AGAGGTAGTC CCCGATCACT ATCAATACGC CGAGCGCAAG
ATGGTTCAGA TTCTTCAAGA AGCGGAGGCA AACTGTTTGG CGATCATCGC CGCGCACGAG
GACGCAGTCA ACGCGATGGT GGATCGCTTG ATTGAAAACG AGACGCTTTC TTTGGCCGAG
TTCGAAGAAA TCCTCGCGGC TCAT
 
Protein sequence
MEFLAYSVGG LAFMIGSIYL GIVRRTSVPQ DQFQAMQFAQ SRAGARRDGT VDVTLEDVGG 
LENIIEDLEE VVAFLKEPER FAKVGARPPK GLLMEGGPGV GKTLIAKAIA GEAKVPFYSM
SGSEFVEIIV GVGAARVRDL FKRARINAPC LIFVDEIDAL GTKRAAAGTR GTEEHEQTLN
QLLTEMDGFT PDTGVVFIGA TNRADLLDPA LLRPGRFDRK VRVGLPNVEA RAKILQIHLS
KRNCNPEIDT KRLAQNLPGL SGAEIANICN EAAVHCVRRQ GEQIEEHDVL DAVERVVSGI
RLTAHPKESV TTRKLAAHEV GHALVQNLLH KSNGLIEDIE MISIIPRGFE PAITLIQRKR
DEDYRYPTRA RMCERVQVLL AGRSCEKVLF GEASTRGSED VCEANDLLRN MIVNFGLGQP
GMMTTYTYDP KHLNKSERRV ARLQGAMVQI LQEAEANCLA IIAAHEDAVN AMVDRLIENE
TLSLAEFEEI LAAH