Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37796 |
Symbol | |
ID | 5005970 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 16780 |
End bp | 18765 |
Gene Length | 1986 bp |
Protein Length | 636 aa |
Translation table | |
GC content | 63% |
IMG OID | 640421391 |
Product | predicted protein |
Protein accession | XP_001421942 |
Protein GI | 145355383 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGCG AGATCGCGAG ACGCGCGATC AAACGCGGCG GCGCGCGATG CGCGACGCAG CTCGCGCGCG CGACGTCGTC CGCGTCCGCG TCGTCGTCAT CGTCGCGCGC CGTCGCGTCA CCGGCGTCGA CGACGCCGCG AAGGCGATTG ACGACGTCCC TGCGAGACGC GAACGCGCGA GGACGACACG CGGAGGTGAT CGAGGCGTAT GAAAACGGCG CCGCGGTGCG CACGGAGGCG AACACGGCGG AGTACCTGAA GGCGCTCGTG GCGCTCGATC GAGTGAACGA GAGCGCGTTG GCGCGCGCGG TGCACAGAGG CGCGACGGCG GAGGCGGCGG CGACGACGGG CGCAATCGGT GCGAGCGCGA CGGCGACGGA GAGCGATGCG GCGCCGAAGG GGATGTTGGC GTCGTTGGGG TCGATCTTCG GCGCGAACGC GGGGGCGAGC GCGGCGAACG CGGCGCCCGC GGTGGCGGCG CTCGGAAGCG AAAAGAACCC GTTGTACACG CAACAGCTCG AACCGACTTT CAAGGCGCAA TTGTGGCGCA CGGTGCGCAC GCTGGGGACG GCGTTCATCG TGCTGAGCGG GATCGGGGCG CTCTTGGAGG ATCGCGGAGG GATGAGCAAG GCGATTTTAG GCGGCGAAAG CGTCAAGCCG CATCAAAACA CGCAGACGAC GACTTTCGAC GACGTCAAGG GCGTGGACGA GGCCAAGGCG GAGTTGGTGG AGATCGTAGA GTACCTCAAG GCGCCGGAAA AGTTCACCAA ACTCGGCGGT AAGTTACCCA AAGGCTTGCT TCTCGTCGGC CCGCCGGGAA CGGGGAAGAC GATGCTCGCC AAGGCGGTCG CGGGCGAAGC GGGCGTGCCA TTCTTTTACA GCAGCGGTAG CGAGTTCGAA GAGATGTTCG TCGGCGTCGG CGCGCGGAGA GTGCGAGATC TCTTCAAGGC GGCTAAGCAA AACGCGCCGT GCATCGTTTT TATCGACGAA ATCGACGCCG TGGGGGCGGC GAGAAACCCT AAGGACCAAC AAAACACTCG CATGACGTTG AACCAACTCT TGACCGAGCT CGATGGCTTT AAAGCGAGCG AGGGCGTCAT CGTGCTCGCG GCCACGAACA CACCGGGGAT GTTGGACAAG GCTTTGATTC GTCCAGGGCG ATTCGATCGC ACGGTGTCCG TGCCCAATCC CGACGTCGGC GGCCGCCGCG AAATTTTACA GGCGCACGCC AAGGGCGTGA AGATGGCGGA TAATGTCGAC TTCGACGTCG TCGCGCGCGG CACTCCCGGT TTCAGCGGCG CTGACTTGGC AAACTTGATA AACATCGCCG CGCTTAAAGC CGCGCTCGAC GGCGTCGCGA GCGTCGGCGC CAAGCACCTC GATTTCGCCA AGGATCGCAT CTTGATGGGC GCCGCGCGCA CATCAGCCAT CATCACGCCC GAAAATCGCA AGTTGACGGC GTATCACGAA GGTGGGCACG CGTTGGTGGC GCTTCGCACG AAGGGCGCGC GTCCGGTGCA CAAGGCGACC ATCGTTCCGC GAGGGCAAGC GTTGGGGATG GTGATGCAAC TCCCCGAGAA GGACGAATTG CAAATGACGC GAAGACAACT GCTCGCCATG CTCGACGTCA CCATGGGCGG TCGTGTGGCG GAGGAGCTCA TCTTTGGTTC CGAGGAGATC ACCACCGGGG CTTCGAGCGA TTTACAGCAA GCCACCCGTC TGGCGCGAGA GATGGTGACG CGCTACGGCA TGAGCGAAAA AGTCGGCTTG GCGTCGCAAG ACTACGCGTC CGATGAGTTG TCGAGCGAAA CTCGACAGCT GATCGAGATC GAGGTGAAAG CGATGCTCGA CGCGGCGTAT AAACGCGCGA AAGATTTACT CACTCAACAC GAGGGCGATT TGCACACGAT TGCGCGACGC TTGCTGGACT CCGAGAGCTT GAGTGGAAGC GAGTTGAAGG AGCTTTGCGG AATAGCCACC GCGTGA
|
Protein sequence | MLREIARRAI KRGGARCATQ LARATSSASA SSSSSRAVAS PASTTPRRRL TTSLRDANAR GRHAEVIEAY ENGAAVRTEA NTAEYLKALV ALDRVNESAL ARAVHRGATA EAAATTGAIG ASATATESDA APKGMLASEK NPLYTQQLEP TFKAQLWRTV RTLGTAFIVL SGIGALLEDR GGMSKAILGG ESVKPHQNTQ TTTFDDVKGV DEAKAELVEI VEYLKAPEKF TKLGGKLPKG LLLVGPPGTG KTMLAKAVAG EAGVPFFYSS GSEFEEMFVG VGARRVRDLF KAAKQNAPCI VFIDEIDAVG AARNPKDQQN TRMTLNQLLT ELDGFKASEG VIVLAATNTP GMLDKALIRP GRFDRTVSVP NPDVGGRREI LQAHAKGVKM ADNVDFDVVA RGTPGFSGAD LANLINIAAL KAALDGVASV GAKHLDFAKD RILMGAARTS AIITPENRKL TAYHEGGHAL VALRTKGARP VHKATIVPRG QALGMVMQLP EKDELQMTRR QLLAMLDVTM GGRVAEELIF GSEEITTGAS SDLQQATRLA REMVTRYGMS EKVGLASQDY ASDELSSETR QLIEIEVKAM LDAAYKRAKD LLTQHEGDLH TIARRLLDSE SLSGSELKEL CGIATA
|
| |