Gene OSTLU_38040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38040 
Symbol 
ID5004062 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp192915 
End bp194288 
Gene Length1374 bp 
Protein Length437 aa 
Translation table 
GC content56% 
IMG OID640419483 
Productpredicted protein 
Protein accessionXP_001420108 
Protein GI145351488 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.990763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00560331 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGAAGT ATCCTCGCTG GCTCGAGATT ACGGCGTTCC TGTTCAACGC GTTGACGCCG 
ATGGTGATGG TGTTCTATGC GTGGCTTATT TACGAAGGCA CGTACAAGGA TAGCTCGGAA
GACATGTTCG GTAACATGAC TACGCGTAAC TACGACTCGA ACGTTCGCCA AGGGATGACG
TTGAAGGACA TCACCGGCAT CGACAACGTC AAGGCGGAAA TGTTTGAACT CATTTCCTAC
TTGAAGGATT TCGAAAAGTA CAACTCCATG GGCGCGCGCA TTCCCGCAGG CGTGCTTTTG
TGCGGTCCGC CCGGTACGGG TAAGACGTTG CTCGCTCGTT GTGTCGCGGG CGAGGCAAAT
GTGCCCTTCT TCTCATGCGC TGGTACGGAG TTTATGGAGA TGTTCGTCGG CGTCGGAGCC
GCGCGTATTC GCAACTTGTT TGATCAAGCC AAGAAGGTTG CGCCGTGCAT CATCTTCATC
GATGAATTCG ATGCCGTCGG CACGAAGCGT ACTGAGACAC AATCCGGTCA GGTTTACGGT
AACGACGAAG CGACGGCGAC GATCAATCAA ATGCTGACGG AGATGGACGG TTTCTCGACC
GCCACGGGCA TCATGGTGTT GGCGGCGACG AACCGTCCGC AAGTACTCGA TCCTGCGCTC
ATTCGTGCCG GTCGTTTTGA TCGCATCATC GAGATGGGTC TGCCGAACAA GAAGTCGCGT
CAAGAAATCT TGTTCTTGCA CTGCAACAAG CCATCGTTCG CGTCGAGTGT CGATCCCAAC
TTGGACTACG AGTACCTCGC CAGACAAACT GCCGGTTTCA GCGGCGCCGA CATCGAGAAC
CTCACCAAGT CGGCCGTCAT GCGCTGTGCG CAAGGCGAGA AGGCGCTCGC GTCTACGGGT
GACTTCTTGT TTTGCATCGA CGATATTCGA CGATCGCAAG CGTTCGTTCG CAACGGAAGT
GGCAGCGGAA GTTTGGCTCG CGATAGAATG CTCGAAGATA CCCTCATCGC GCAACTCGAC
GCGTACGAAC GAGACTCGGT GGTGAATTAT TACGCCGCGC AGGCCGTGGT CGCGATGCAC
ATGCCTTCGT ACGACGAGAT TAGCAAGGTG ACGGTGTTTA ACGGTGGTGT AGCCACGGGT
CAAATCGTCT ACGTGCCGGA TGAAGTCGAC TCTCCCGCGG CTCGCACGGT GCGCTCCATG
GAGTATTACG AAGCCAAGCT GTGCGTACTC CTCGCAGGCC AGATGGCGGA GCGTTATTTG
TACGGTCCCG AGAACGTAAC GACCCGCGGC ATGCACGATG TCGCCGCTGC GACGAATTTG
GCGTGCGAAA TGGTGATGCA GAACGGGTGG AGTGATTTAG GTCCCATCGC CCTC
 
Protein sequence
MEKYPRWLEI TAFLFNALTP MVMVFYAWLI YEGTYKDSSE DMFGNMTTRN YDSNVRQGMT 
LKDITGIDNV KAEMFELISY LKDFEKYNSM GARIPAGVLL CGPPGTGKTL LARCVAGEAN
VPFFSCAGTE FMEMFVGVGA ARIRNLFDQA KKVAPCIIFI DEFDAVGTKR TETQSGQVYG
NDEATATINQ MLTEMDGFST ATGIMVLAAT NRPQVLDPAL IRAGRFDRII EMGLPNKKSR
QEILFLHCNK PSFASSVDPN LDYEYLARQT AGFSGADIEN LTKSAVMRCA QGEKALASTG
DFFLARDRML EDTLIAQLDA YERDSVVNYY AAQAVVAMHM PSYDEISKVT VFNGGVATGQ
IVYVPDEVDS PAARTVRSME YYEAKLCVLL AGQMAERYLY GPENVTTRGM HDVAAATNLA
CEMVMQNGWS DLGPIAL