Gene Saro_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1086 
Symbol 
ID3916382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1128972 
End bp1129958 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content68% 
IMG OID640443821 
Producthomoserine kinase 
Protein accessionYP_496365 
Protein GI87199108 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTCT ATACCCAGAT CGGCGCCGAG GACATGGCCG CGCTCGTCGC CGAGTTCGAC 
GTGGGCGAAC TCGTTTCGGC CAAGGGCATC GCCGAGGGCG TGTCGAACAG CAACTGGCTG
CTCGACACCA CCGGGCGCGA CGGCAAGGGC GCGCGGTTCA TCCTGACGAT GTACGAATTC
CGCATCGAGC TGGAAGACCT GCCGTATTTC CTCTCGCTGC TCGATCACCT TGCTGGGCGC
GGCTGCGCGG TGCCGCGCAC GATCCACGAC CGCGCAGGCG CGCTCTACCG CATGCGCGGG
GACAAGGCGC TGGCGCTCAT CGAGTTCCTG CCCGGCGTCT CGGTCAGCGA GCCGACCCCG
GCGCAGGCCC GCGCCGTGGG CACGGCGCTG GCGCAGATGC ACCTGGCCTC TGCCGACTTC
GCCGGCTCGC GAGAAAACGG GATGGGACTG GCGGAATGGC AGCGTCTGTT CGATGCCTGC
GGGGCGGAAG GGCTGGCACG GATCGACCCC GACCTTGCCG GACTGGTGGC AGAACACATG
CCCCGCATCG CGGCGCAATG GCCCGCCGAC CTGCCGCGCT CGGTCATCCA TGCGGACCTC
TTCCCCGACA ACGTGCTGAT GCTGGGCGAC AAGGTCACCG GCCTCATCGA CTTCTACTTC
GCCTGCAACG ACATCATGGC CTACGATGTG GCGGTCACCC ATGCGGCGTG GTGCTTCGAC
GGCAGCGGGC GGAGCTTCGA TCCGGCCGTC TCGGCGGCGC TGCTCGAAGG CTACGAGTCG
GTGCGGCCAC TGCTGCCGGA AGAGCGCGCG GCCCTGCCGC TGCTGGCGCA GGGCGCGGCG
ATGCGCTTCA CATCGAGCCG GGCCTATGAC TGGCTGAATA CGCCGGCCGA CGCGCTGGTG
GTGCGCAAGG ACCCGATGGC GTTCGCCCGG CGGCTGCAAT TCTACGCCGC CAATCCCGCC
ATATTCGACA CGAATGCTTT CGCGTGA
 
Protein sequence
MAVYTQIGAE DMAALVAEFD VGELVSAKGI AEGVSNSNWL LDTTGRDGKG ARFILTMYEF 
RIELEDLPYF LSLLDHLAGR GCAVPRTIHD RAGALYRMRG DKALALIEFL PGVSVSEPTP
AQARAVGTAL AQMHLASADF AGSRENGMGL AEWQRLFDAC GAEGLARIDP DLAGLVAEHM
PRIAAQWPAD LPRSVIHADL FPDNVLMLGD KVTGLIDFYF ACNDIMAYDV AVTHAAWCFD
GSGRSFDPAV SAALLEGYES VRPLLPEERA ALPLLAQGAA MRFTSSRAYD WLNTPADALV
VRKDPMAFAR RLQFYAANPA IFDTNAFA