Gene OSTLU_33128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33128 
Symbol 
ID5003448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp434479 
End bp435864 
Gene Length1386 bp 
Protein Length461 aa 
Translation table 
GC content61% 
IMG OID640418869 
Productpredicted protein 
Protein accessionXP_001419163 
Protein GI145349485 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.510611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0211144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGTC GCGGACGCGC GCGCGCGAGC GATCGTCGCG CGCGGGGCGC GGTGGTGGCT 
TCGGGGAAGA TTGATTTCAC GGACGCTGGG TGTTTCGAGC TGCCGTACGC GCCGGTGGCG
GAGGCGTTGC TTCCGGAGGG GCCGTGGAAG GTTGTCGAAG GCGGCGTGTG CGCGGCGAAG
GGATTTAAAG TCGCGGGATA TAAGGCGGGA TTGCGCGCCA AGGGCACGCG CGCCGACTGC
GCGCTCATCG TCGCGGACGA AGACGCGACG TGCGCGGGGA TTTTTACGAC AAACATCATG
TGCGCCGCGC CGGTGACGTA CTGTAAGAAA CAGTTGGCGG GGAAACCGAC GGCGCGAGCG
CTGTTGATCA ACGCTGCACA AGCGAACGCC GCAACAGGTG ATCAGGGCGC TGCGGACGCG
CAAGCGACCG CGGAGGAGTT GTCGAAATCC CTCGGCGTCG CCGAGGAGGA CATTTTACTC
ATGTCCACGG GCGTCATTGG CAAGCGAATT AAGCTGGACA AACTCATGCC AGCGATTCCG
ATTTTGTCCG CGAACGTGGA AAGTAGTACT GCGGCGGCAA ACGCGGCGGC GACCGCGATA
TGCACCACCG ATCTCGTGCG AAAGACGGTC GCGATTGAAG TGCAAATTGG CGGTAAAACC
GTTTGCATGG GCGGCATGGC CAAGGGGAGC GGGATGATTC ATCCAAATAT GGCGACCATG
CTCGGCGTAG TGACGTGCGA TGCGGACGTG ACGCCCGAAG TTTGGCGTAA CATCACCTCC
CGCGCCGGGG CGGCGTCTTT CAATCAAATC TCCGTCGATG GCGATACGAG CACGAACGAT
TCTTTGGTGT GTTTCGCCAG TGGCAAAGCC GGCAACGCCA AAATCACGAG CGTCGATTCT
GCCGAAGGCA AGCTTCTCGA ACAAGCACTC ACCGCGGTCT GCCGCGGTCT CGCCAAAGCC
ATCGCTTGGG ACGGTGAAGG TGCGACGTGT TTGATCGAGT GCAACGTCTC TGGAGCCGCA
GACGACGAAG ACGCGCGCGT CATTGCGCGT TCCGTCGTCT GTTCCTCGCT CGCTAAGGCA
GCCATCTTCG GACACGATCC AAACTGGGGG CGATTGGCTT GCGCTGCGGG GTACGCCGCG
CCGGTGAAGA ACAGATTCGA TCAAAATGAT CTCAAGCTCT CACTCGGCCC GCACCAGCTC
ATGGATAAAG GTCAACCGCT CGATTTCGAC GCCGTCGCCG CCAGCCGATA CCTCAAGGAA
GTCACCGGCG TTCACGGCAC GTGCGTCGTC GACATCTCCG TCGGCAACGG ATCTGGCCGA
GGCCAAGCCT GGGGCTGCGA CTTATCGTAC GATTACGTCA AAATCAACGC CGAATACACG
ACGTAG
 
Protein sequence
MTSRGRARAS DRRARGAVVA SGKIDFTDAG CFELPYAPVA EALLPEGPWK VVEGGVCAAK 
GFKVAGYKAG LRAKGTRADC ALIVADEDAT CAGIFTTNIM CAAPVTYCKK QLAGKPTARA
LLINAAQANA ATGDQGAADA QATAEELSKS LGVAEEDILL MSTGVIGKRI KLDKLMPAIP
ILSANVESST AAANAAATAI CTTDLVRKTV AIEVQIGGKT VCMGGMAKGS GMIHPNMATM
LGVVTCDADV TPEVWRNITS RAGAASFNQI SVDGDTSTND SLVCFASGKA GNAKITSVDS
AEGKLLEQAL TAVCRGLAKA IAWDGEGATC LIECNVSGAA DDEDARVIAR SVVCSSLAKA
AIFGHDPNWG RLACAAGYAA PVKNRFDQND LKLSLGPHQL MDKGQPLDFD AVAASRYLKE
VTGVHGTCVV DISVGNGSGR GQAWGCDLSY DYVKINAEYT T