Gene OSTLU_52011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_52011 
Symbol 
ID5006810 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp442844 
End bp444176 
Gene Length1333 bp 
Protein Length334 aa 
Translation table 
GC content65% 
IMG OID640422231 
Productpredicted protein 
Protein accessionXP_001422592 
Protein GI145356757 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0548] Acetylglutamate kinase 
TIGRFAM ID[TIGR00761] acetylglutamate kinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.367344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.167765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAATTTCCTC GCCGCGTCTC GTCGACGCGC GCGCCTCTTC CGTCCCGTCG ACGGGCCGCG 
TGAGAACTTA AACATGTCTG CCGCGTCGTG CGCGACGCGC GCGCGCGCGA GCGCGCGCGG
GGACGGCGCG CGGGGGGCGC GAAGATTCGC GACGCGTTCC AGGGTATGTT TTTCGACGCG
GCGACGCGGC GCCGAGGGCG CGCGCGCGGG ATCGAGCGCG CGCGGCGAAC GACGATGTCG
TCGCGATGCG CCGCGCGCGC GGGTCGCGCG CGACGGTGGC GCGCGCGCGA ACGCGCGGGC
GTCGGACTAG GTTTTGAACG CGGCGCGGCG ACGACGCGCC GAGGCGCGGG GAATACGATT
GGAGACGGCG CGCGATCGAC GGAAAGCGCG AGACTGACGA GCGGGTTCGC GATCGCAGAC
TTCAACCGCG CATCGCGCGC GCGCGATCGC GCTGAGCGCG GAACAGGAGA GCGATAATCA
AACGCGCGTC AAGGTGCTGT CGGAGGCGTT GCCGTATTTG CAACGCTTCG CGGGGCAAAC
CGTGGTGGTC AAGTACGGCG GCGCGGCGAT GAAGTCCGAG GAACTGAAGG CGGCGGTGAT
TCGTGATGTG GTGTTGCTGT CGACGGTGGG CATTCGGCCG GTGCTCGTGC ACGGGGGTGG
GCCAGAGATT AACGCGATGC TGAACAGGGT CGGCGTCGAG GCGAAGTTCT TAAACGGGCT
GCGAGTCACC GACGCGCAGA CGATGGAAAT CGTCGAGCAG GTGCTCACGG GTAAGGTGAA
CAAGTCCATC GTGAGCTTGA TTTCGTGCGC AGGCGGGAAA GCTGTCGGGA TCTGCGGCAA
GGATGGGAAT TTATTGCGCG GCGTGGTGAA GAGCGAGGAG TTGGGTTTCG TCGGCGACGT
GACGCAGGTG GACACGCGGT TGATTCGTGA ACTCGTGAAT GTGGGGTACA TTCCCGTCGT
CGCCACTGTC GCCATGGATG CAGACGGTCA GGCGCTTAAC GTGAACGCGG ATACCGCGGC
TGGTGCAATC GCGGCCAAGC TCGGGGCGGA GAAGCTCATC TTGATGACGG ACGTTCCGGG
CGTGTGCACC GATAAGGATG ATCCCAACAC TTTGATTCGC GAGCTGACCA TGAAGGAGAC
GGAAGAAGCC ATCGCGAAGG GTGTGATTGC GGGAGGCATG ATTCCCAAAG TCGAATGTTG
CATGACCAGC ATCACGAACG GCGTGAAGAG CGCGCACATC ATCGACGGTC GCGCCAAGCA
CAGCCTACTC CTCGAAATTC TCACCGACAC CGGCGTCGGC ACCGTCATCA CTTCCCCCGT
CGTCGCCGTA TAA
 
Protein sequence
MSAASCATRA RASARGDGAR GARRFATRSR TSTAHRARAI ALSAEQESDN QTRVKVLSEA 
LPYLQRFAGQ TVVVKYGGAA MKSEELKAAV IRDVVLLSTV GIRPVLVHGG GPEINAMLNR
VGVEAKFLNG LRVTDAQTME IVEQVLTGKV NKSIVSLISC AGGKAVGICG KDGNLLRGVV
KSEELGFVGD VTQVDTRLIR ELVNVGYIPV VATVAMDADG QALNVNADTA AGAIAAKLGA
EKLILMTDVP GVCTDKDDPN TLIRELTMKE TEEAIAKGVI AGGMIPKVEC CMTSITNGVK
SAHIIDGRAK HSLLLEILTD TGVGTVITSP VVAV