Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33128 |
Symbol | |
ID | 5003448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 434479 |
End bp | 435864 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | |
GC content | 61% |
IMG OID | 640418869 |
Product | predicted protein |
Protein accession | XP_001419163 |
Protein GI | 145349485 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) |
TIGRFAM ID | [TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.510611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0211144 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGTC GCGGACGCGC GCGCGCGAGC GATCGTCGCG CGCGGGGCGC GGTGGTGGCT TCGGGGAAGA TTGATTTCAC GGACGCTGGG TGTTTCGAGC TGCCGTACGC GCCGGTGGCG GAGGCGTTGC TTCCGGAGGG GCCGTGGAAG GTTGTCGAAG GCGGCGTGTG CGCGGCGAAG GGATTTAAAG TCGCGGGATA TAAGGCGGGA TTGCGCGCCA AGGGCACGCG CGCCGACTGC GCGCTCATCG TCGCGGACGA AGACGCGACG TGCGCGGGGA TTTTTACGAC AAACATCATG TGCGCCGCGC CGGTGACGTA CTGTAAGAAA CAGTTGGCGG GGAAACCGAC GGCGCGAGCG CTGTTGATCA ACGCTGCACA AGCGAACGCC GCAACAGGTG ATCAGGGCGC TGCGGACGCG CAAGCGACCG CGGAGGAGTT GTCGAAATCC CTCGGCGTCG CCGAGGAGGA CATTTTACTC ATGTCCACGG GCGTCATTGG CAAGCGAATT AAGCTGGACA AACTCATGCC AGCGATTCCG ATTTTGTCCG CGAACGTGGA AAGTAGTACT GCGGCGGCAA ACGCGGCGGC GACCGCGATA TGCACCACCG ATCTCGTGCG AAAGACGGTC GCGATTGAAG TGCAAATTGG CGGTAAAACC GTTTGCATGG GCGGCATGGC CAAGGGGAGC GGGATGATTC ATCCAAATAT GGCGACCATG CTCGGCGTAG TGACGTGCGA TGCGGACGTG ACGCCCGAAG TTTGGCGTAA CATCACCTCC CGCGCCGGGG CGGCGTCTTT CAATCAAATC TCCGTCGATG GCGATACGAG CACGAACGAT TCTTTGGTGT GTTTCGCCAG TGGCAAAGCC GGCAACGCCA AAATCACGAG CGTCGATTCT GCCGAAGGCA AGCTTCTCGA ACAAGCACTC ACCGCGGTCT GCCGCGGTCT CGCCAAAGCC ATCGCTTGGG ACGGTGAAGG TGCGACGTGT TTGATCGAGT GCAACGTCTC TGGAGCCGCA GACGACGAAG ACGCGCGCGT CATTGCGCGT TCCGTCGTCT GTTCCTCGCT CGCTAAGGCA GCCATCTTCG GACACGATCC AAACTGGGGG CGATTGGCTT GCGCTGCGGG GTACGCCGCG CCGGTGAAGA ACAGATTCGA TCAAAATGAT CTCAAGCTCT CACTCGGCCC GCACCAGCTC ATGGATAAAG GTCAACCGCT CGATTTCGAC GCCGTCGCCG CCAGCCGATA CCTCAAGGAA GTCACCGGCG TTCACGGCAC GTGCGTCGTC GACATCTCCG TCGGCAACGG ATCTGGCCGA GGCCAAGCCT GGGGCTGCGA CTTATCGTAC GATTACGTCA AAATCAACGC CGAATACACG ACGTAG
|
Protein sequence | MTSRGRARAS DRRARGAVVA SGKIDFTDAG CFELPYAPVA EALLPEGPWK VVEGGVCAAK GFKVAGYKAG LRAKGTRADC ALIVADEDAT CAGIFTTNIM CAAPVTYCKK QLAGKPTARA LLINAAQANA ATGDQGAADA QATAEELSKS LGVAEEDILL MSTGVIGKRI KLDKLMPAIP ILSANVESST AAANAAATAI CTTDLVRKTV AIEVQIGGKT VCMGGMAKGS GMIHPNMATM LGVVTCDADV TPEVWRNITS RAGAASFNQI SVDGDTSTND SLVCFASGKA GNAKITSVDS AEGKLLEQAL TAVCRGLAKA IAWDGEGATC LIECNVSGAA DDEDARVIAR SVVCSSLAKA AIFGHDPNWG RLACAAGYAA PVKNRFDQND LKLSLGPHQL MDKGQPLDFD AVAASRYLKE VTGVHGTCVV DISVGNGSGR GQAWGCDLSY DYVKINAEYT T
|
| |