Gene OSTLU_43628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43628 
Symbol 
ID5006773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp331197 
End bp332138 
Gene Length942 bp 
Protein Length313 aa 
Translation table 
GC content58% 
IMG OID640422194 
Productpredicted protein 
Protein accessionXP_001422554 
Protein GI145356678 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01851] N-acetyl-gamma-glutamyl-phosphate reductase, uncommon form 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.100781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00250662 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGCGGG TGTTCATCGA TGGCGAGGCC GGAACCACGG GGCTTCAGGT GCGCGAACGG 
TTGGAGGCGC GAGGGGACGT CGAGCTGATT CAACTGGATG AGGGCTCGAG AAAGAATCTC
GAGGCGAGAC GCGCGGCGTT GAACGAGTGC GACGCGGCGA TTCTGTGCCT TCCCGATCAG
GCGGCGGAGG AGGCGGTGAA ATTGGTGGAG AATGAGACGA CGGTGGTGAT CGATGCGTCG
ACGGCTTTTC GCGTCGCCGA TGGTTGGACG TACGGATTCC CGGAGCTGGC GCCAGGGCAT
CGAGAGTTGG TCAAGGCGTC GAAGAGAATC TCAAATCCGG GGTGCTATCC CACCGGGTTC
ATCGCACTCA CTAGACCATT GGTTGACGCG GGCATCCTGT CTCCAGGCGC GGCGTTGACG
GTGAACGCGG TGAGCGGGTA CACGGGCGGC GGTAAGGCGC TCATCAAGGT GTACGAGGAG
GAAGAACACG AGCCGTGGGG CGCTTACGGA TTTAATCTCG AACACAAGCA CTTGCCAGAG
ATGGCGAAAT GGAGCATGAT CGGTCGTGAA CCAATTTTCA TGCCATCTGT CGGTTCCTTC
GCGCAAGGTA TGGTGGTGAG CGTACCGTTG CATTACGATC AACTTGCCGC CGACGCTCGC
AGCGCCAAGC GTCTGCATGA GTGCTTACGC GCGCGGTACG CGCAGAGTAC GTACGTTTCG
GTGCGAGATT TGAACAAGAT GGACGACCTC GAGCGTGGAG CTTTCATGAG ACCAGACTCT
TTGGCGAACA CGAACAAGCT CGAGTTAAGT GTTTACGCCA ACGACTCAAA GCGAACCGCC
GTTCTCGTGG CAAGGTTAGA TAATTTGGGC AAAGGTGCTT CGGGTGCGGC GGTGCAAAAC
ATGAACTTGG CGCTTGGACT GGATGAAACA ATGGGATTGT AG
 
Protein sequence
MKRVFIDGEA GTTGLQVRER LEARGDVELI QLDEGSRKNL EARRAALNEC DAAILCLPDQ 
AAEEAVKLVE NETTVVIDAS TAFRVADGWT YGFPELAPGH RELVKASKRI SNPGCYPTGF
IALTRPLVDA GILSPGAALT VNAVSGYTGG GKALIKVYEE EEHEPWGAYG FNLEHKHLPE
MAKWSMIGRE PIFMPSVGSF AQGMVVSVPL HYDQLAADAR SAKRLHECLR ARYAQSTYVS
VRDLNKMDDL ERGAFMRPDS LANTNKLELS VYANDSKRTA VLVARLDNLG KGASGAAVQN
MNLALGLDET MGL