Gene OSTLU_37796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37796 
Symbol 
ID5005970 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp16780 
End bp18765 
Gene Length1986 bp 
Protein Length636 aa 
Translation table 
GC content63% 
IMG OID640421391 
Productpredicted protein 
Protein accessionXP_001421942 
Protein GI145355383 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCG AGATCGCGAG ACGCGCGATC AAACGCGGCG GCGCGCGATG CGCGACGCAG 
CTCGCGCGCG CGACGTCGTC CGCGTCCGCG TCGTCGTCAT CGTCGCGCGC CGTCGCGTCA
CCGGCGTCGA CGACGCCGCG AAGGCGATTG ACGACGTCCC TGCGAGACGC GAACGCGCGA
GGACGACACG CGGAGGTGAT CGAGGCGTAT GAAAACGGCG CCGCGGTGCG CACGGAGGCG
AACACGGCGG AGTACCTGAA GGCGCTCGTG GCGCTCGATC GAGTGAACGA GAGCGCGTTG
GCGCGCGCGG TGCACAGAGG CGCGACGGCG GAGGCGGCGG CGACGACGGG CGCAATCGGT
GCGAGCGCGA CGGCGACGGA GAGCGATGCG GCGCCGAAGG GGATGTTGGC GTCGTTGGGG
TCGATCTTCG GCGCGAACGC GGGGGCGAGC GCGGCGAACG CGGCGCCCGC GGTGGCGGCG
CTCGGAAGCG AAAAGAACCC GTTGTACACG CAACAGCTCG AACCGACTTT CAAGGCGCAA
TTGTGGCGCA CGGTGCGCAC GCTGGGGACG GCGTTCATCG TGCTGAGCGG GATCGGGGCG
CTCTTGGAGG ATCGCGGAGG GATGAGCAAG GCGATTTTAG GCGGCGAAAG CGTCAAGCCG
CATCAAAACA CGCAGACGAC GACTTTCGAC GACGTCAAGG GCGTGGACGA GGCCAAGGCG
GAGTTGGTGG AGATCGTAGA GTACCTCAAG GCGCCGGAAA AGTTCACCAA ACTCGGCGGT
AAGTTACCCA AAGGCTTGCT TCTCGTCGGC CCGCCGGGAA CGGGGAAGAC GATGCTCGCC
AAGGCGGTCG CGGGCGAAGC GGGCGTGCCA TTCTTTTACA GCAGCGGTAG CGAGTTCGAA
GAGATGTTCG TCGGCGTCGG CGCGCGGAGA GTGCGAGATC TCTTCAAGGC GGCTAAGCAA
AACGCGCCGT GCATCGTTTT TATCGACGAA ATCGACGCCG TGGGGGCGGC GAGAAACCCT
AAGGACCAAC AAAACACTCG CATGACGTTG AACCAACTCT TGACCGAGCT CGATGGCTTT
AAAGCGAGCG AGGGCGTCAT CGTGCTCGCG GCCACGAACA CACCGGGGAT GTTGGACAAG
GCTTTGATTC GTCCAGGGCG ATTCGATCGC ACGGTGTCCG TGCCCAATCC CGACGTCGGC
GGCCGCCGCG AAATTTTACA GGCGCACGCC AAGGGCGTGA AGATGGCGGA TAATGTCGAC
TTCGACGTCG TCGCGCGCGG CACTCCCGGT TTCAGCGGCG CTGACTTGGC AAACTTGATA
AACATCGCCG CGCTTAAAGC CGCGCTCGAC GGCGTCGCGA GCGTCGGCGC CAAGCACCTC
GATTTCGCCA AGGATCGCAT CTTGATGGGC GCCGCGCGCA CATCAGCCAT CATCACGCCC
GAAAATCGCA AGTTGACGGC GTATCACGAA GGTGGGCACG CGTTGGTGGC GCTTCGCACG
AAGGGCGCGC GTCCGGTGCA CAAGGCGACC ATCGTTCCGC GAGGGCAAGC GTTGGGGATG
GTGATGCAAC TCCCCGAGAA GGACGAATTG CAAATGACGC GAAGACAACT GCTCGCCATG
CTCGACGTCA CCATGGGCGG TCGTGTGGCG GAGGAGCTCA TCTTTGGTTC CGAGGAGATC
ACCACCGGGG CTTCGAGCGA TTTACAGCAA GCCACCCGTC TGGCGCGAGA GATGGTGACG
CGCTACGGCA TGAGCGAAAA AGTCGGCTTG GCGTCGCAAG ACTACGCGTC CGATGAGTTG
TCGAGCGAAA CTCGACAGCT GATCGAGATC GAGGTGAAAG CGATGCTCGA CGCGGCGTAT
AAACGCGCGA AAGATTTACT CACTCAACAC GAGGGCGATT TGCACACGAT TGCGCGACGC
TTGCTGGACT CCGAGAGCTT GAGTGGAAGC GAGTTGAAGG AGCTTTGCGG AATAGCCACC
GCGTGA
 
Protein sequence
MLREIARRAI KRGGARCATQ LARATSSASA SSSSSRAVAS PASTTPRRRL TTSLRDANAR 
GRHAEVIEAY ENGAAVRTEA NTAEYLKALV ALDRVNESAL ARAVHRGATA EAAATTGAIG
ASATATESDA APKGMLASEK NPLYTQQLEP TFKAQLWRTV RTLGTAFIVL SGIGALLEDR
GGMSKAILGG ESVKPHQNTQ TTTFDDVKGV DEAKAELVEI VEYLKAPEKF TKLGGKLPKG
LLLVGPPGTG KTMLAKAVAG EAGVPFFYSS GSEFEEMFVG VGARRVRDLF KAAKQNAPCI
VFIDEIDAVG AARNPKDQQN TRMTLNQLLT ELDGFKASEG VIVLAATNTP GMLDKALIRP
GRFDRTVSVP NPDVGGRREI LQAHAKGVKM ADNVDFDVVA RGTPGFSGAD LANLINIAAL
KAALDGVASV GAKHLDFAKD RILMGAARTS AIITPENRKL TAYHEGGHAL VALRTKGARP
VHKATIVPRG QALGMVMQLP EKDELQMTRR QLLAMLDVTM GGRVAEELIF GSEEITTGAS
SDLQQATRLA REMVTRYGMS EKVGLASQDY ASDELSSETR QLIEIEVKAM LDAAYKRAKD
LLTQHEGDLH TIARRLLDSE SLSGSELKEL CGIATA