Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43628 |
Symbol | |
ID | 5006773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 331197 |
End bp | 332138 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | |
GC content | 58% |
IMG OID | 640422194 |
Product | predicted protein |
Protein accession | XP_001422554 |
Protein GI | 145356678 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01851] N-acetyl-gamma-glutamyl-phosphate reductase, uncommon form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.100781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00250662 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGCGGG TGTTCATCGA TGGCGAGGCC GGAACCACGG GGCTTCAGGT GCGCGAACGG TTGGAGGCGC GAGGGGACGT CGAGCTGATT CAACTGGATG AGGGCTCGAG AAAGAATCTC GAGGCGAGAC GCGCGGCGTT GAACGAGTGC GACGCGGCGA TTCTGTGCCT TCCCGATCAG GCGGCGGAGG AGGCGGTGAA ATTGGTGGAG AATGAGACGA CGGTGGTGAT CGATGCGTCG ACGGCTTTTC GCGTCGCCGA TGGTTGGACG TACGGATTCC CGGAGCTGGC GCCAGGGCAT CGAGAGTTGG TCAAGGCGTC GAAGAGAATC TCAAATCCGG GGTGCTATCC CACCGGGTTC ATCGCACTCA CTAGACCATT GGTTGACGCG GGCATCCTGT CTCCAGGCGC GGCGTTGACG GTGAACGCGG TGAGCGGGTA CACGGGCGGC GGTAAGGCGC TCATCAAGGT GTACGAGGAG GAAGAACACG AGCCGTGGGG CGCTTACGGA TTTAATCTCG AACACAAGCA CTTGCCAGAG ATGGCGAAAT GGAGCATGAT CGGTCGTGAA CCAATTTTCA TGCCATCTGT CGGTTCCTTC GCGCAAGGTA TGGTGGTGAG CGTACCGTTG CATTACGATC AACTTGCCGC CGACGCTCGC AGCGCCAAGC GTCTGCATGA GTGCTTACGC GCGCGGTACG CGCAGAGTAC GTACGTTTCG GTGCGAGATT TGAACAAGAT GGACGACCTC GAGCGTGGAG CTTTCATGAG ACCAGACTCT TTGGCGAACA CGAACAAGCT CGAGTTAAGT GTTTACGCCA ACGACTCAAA GCGAACCGCC GTTCTCGTGG CAAGGTTAGA TAATTTGGGC AAAGGTGCTT CGGGTGCGGC GGTGCAAAAC ATGAACTTGG CGCTTGGACT GGATGAAACA ATGGGATTGT AG
|
Protein sequence | MKRVFIDGEA GTTGLQVRER LEARGDVELI QLDEGSRKNL EARRAALNEC DAAILCLPDQ AAEEAVKLVE NETTVVIDAS TAFRVADGWT YGFPELAPGH RELVKASKRI SNPGCYPTGF IALTRPLVDA GILSPGAALT VNAVSGYTGG GKALIKVYEE EEHEPWGAYG FNLEHKHLPE MAKWSMIGRE PIFMPSVGSF AQGMVVSVPL HYDQLAADAR SAKRLHECLR ARYAQSTYVS VRDLNKMDDL ERGAFMRPDS LANTNKLELS VYANDSKRTA VLVARLDNLG KGASGAAVQN MNLALGLDET MGL
|
| |