Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_52011 |
Symbol | |
ID | 5006810 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 442844 |
End bp | 444176 |
Gene Length | 1333 bp |
Protein Length | 334 aa |
Translation table | |
GC content | 65% |
IMG OID | 640422231 |
Product | predicted protein |
Protein accession | XP_001422592 |
Protein GI | 145356757 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0548] Acetylglutamate kinase |
TIGRFAM ID | [TIGR00761] acetylglutamate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.367344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.167765 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAATTTCCTC GCCGCGTCTC GTCGACGCGC GCGCCTCTTC CGTCCCGTCG ACGGGCCGCG TGAGAACTTA AACATGTCTG CCGCGTCGTG CGCGACGCGC GCGCGCGCGA GCGCGCGCGG GGACGGCGCG CGGGGGGCGC GAAGATTCGC GACGCGTTCC AGGGTATGTT TTTCGACGCG GCGACGCGGC GCCGAGGGCG CGCGCGCGGG ATCGAGCGCG CGCGGCGAAC GACGATGTCG TCGCGATGCG CCGCGCGCGC GGGTCGCGCG CGACGGTGGC GCGCGCGCGA ACGCGCGGGC GTCGGACTAG GTTTTGAACG CGGCGCGGCG ACGACGCGCC GAGGCGCGGG GAATACGATT GGAGACGGCG CGCGATCGAC GGAAAGCGCG AGACTGACGA GCGGGTTCGC GATCGCAGAC TTCAACCGCG CATCGCGCGC GCGCGATCGC GCTGAGCGCG GAACAGGAGA GCGATAATCA AACGCGCGTC AAGGTGCTGT CGGAGGCGTT GCCGTATTTG CAACGCTTCG CGGGGCAAAC CGTGGTGGTC AAGTACGGCG GCGCGGCGAT GAAGTCCGAG GAACTGAAGG CGGCGGTGAT TCGTGATGTG GTGTTGCTGT CGACGGTGGG CATTCGGCCG GTGCTCGTGC ACGGGGGTGG GCCAGAGATT AACGCGATGC TGAACAGGGT CGGCGTCGAG GCGAAGTTCT TAAACGGGCT GCGAGTCACC GACGCGCAGA CGATGGAAAT CGTCGAGCAG GTGCTCACGG GTAAGGTGAA CAAGTCCATC GTGAGCTTGA TTTCGTGCGC AGGCGGGAAA GCTGTCGGGA TCTGCGGCAA GGATGGGAAT TTATTGCGCG GCGTGGTGAA GAGCGAGGAG TTGGGTTTCG TCGGCGACGT GACGCAGGTG GACACGCGGT TGATTCGTGA ACTCGTGAAT GTGGGGTACA TTCCCGTCGT CGCCACTGTC GCCATGGATG CAGACGGTCA GGCGCTTAAC GTGAACGCGG ATACCGCGGC TGGTGCAATC GCGGCCAAGC TCGGGGCGGA GAAGCTCATC TTGATGACGG ACGTTCCGGG CGTGTGCACC GATAAGGATG ATCCCAACAC TTTGATTCGC GAGCTGACCA TGAAGGAGAC GGAAGAAGCC ATCGCGAAGG GTGTGATTGC GGGAGGCATG ATTCCCAAAG TCGAATGTTG CATGACCAGC ATCACGAACG GCGTGAAGAG CGCGCACATC ATCGACGGTC GCGCCAAGCA CAGCCTACTC CTCGAAATTC TCACCGACAC CGGCGTCGGC ACCGTCATCA CTTCCCCCGT CGTCGCCGTA TAA
|
Protein sequence | MSAASCATRA RASARGDGAR GARRFATRSR TSTAHRARAI ALSAEQESDN QTRVKVLSEA LPYLQRFAGQ TVVVKYGGAA MKSEELKAAV IRDVVLLSTV GIRPVLVHGG GPEINAMLNR VGVEAKFLNG LRVTDAQTME IVEQVLTGKV NKSIVSLISC AGGKAVGICG KDGNLLRGVV KSEELGFVGD VTQVDTRLIR ELVNVGYIPV VATVAMDADG QALNVNADTA AGAIAAKLGA EKLILMTDVP GVCTDKDDPN TLIRELTMKE TEEAIAKGVI AGGMIPKVEC CMTSITNGVK SAHIIDGRAK HSLLLEILTD TGVGTVITSP VVAV
|
| |