Gene Saro_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0019 
Symbol 
ID3916061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp14577 
End bp15884 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID640442744 
Producthomoserine dehydrogenase 
Protein accessionYP_495302 
Protein GI87198045 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC CATTGCGCAT CGCGCTGGCC GGACTTGGCA CGGTAGGCGG CGGAGTGATC 
CGGCTGATCG AGGCGAACGC CGATCTGATC GCGCGCCGCG CGGGCCGGCC GATAGTCATT
ACCACCGTCA GCGCACGCAA TCGCGACAAG GACCGCGGCT TCGACGTGTC GCGCTATGCC
TGGGAAGACG ACATGGTCAT CCTCGGCGAG CGTCCTGACG TGGACGTCGT GGTCGAACTC
GTCGGCGGCG CCGATGGCCC CGCCTTGGCG CTCGCCCGGA CCACGTTCGA GGCCGGCAAG
GCTCTTGTCA CGGCCAACAA GGCAATGATC GCGCACCACG GCGTGGAACT TGCCACAAAG
GCAGAAGCCG CCAAGGTGGC GCTGAAATTC GAGGCTGCGG TCGCTGGCGG CATCCCCGTT
ATCAAGGGAC TCAAGGAAGG CGTCGCCGCC AACGAGATCG CACGGGTCTA TGGCATTCTC
AACGGCACCT GCAACTACAT CCTCTCGACG ATGGAAGACA CCGGCCGCGA TTTCGCCGAC
GTTCTCGCCG AGGCGCAGGC CAAGGGCTAT GCCGAAGCCG ACCCGACCTT CGACATCGAC
GGCATCGACG CCGCGCACAA GCTTTCGATC CTTTCGTCGA TCGCCTTCGG CACGGCGGTG
GACTTCAAGC CCGTGGCCGC GACCGGCATC CGCCGCGTCC TTGCCGCCGA CATCGCGCAG
GCAGATGTGC TCGGCTACTA TATCCGCCTG ATCGGCATGG CCGAGACGGA AATGGACGCT
GCGGGCAACC GCCGCCTGTT CCAGCGGGTC CACCCGCACC TCGTCCATCG CGACCATCCG
CTCGCCCATG TCGACGGCGC GACCAATGCG GTCGTCGCCG AGGGCAATTT CGTGGGCAGG
CTGCTGTTCC AGGGCGCGGG GGCCGGCGAT GGTCCGACCG CCAGCGCCGT GGTCGCCGAT
CTCATCGACA TCGCGCGCGG CGACATCGGC GCGCCCTTCT CGATCCCGGT CGCGGAACTG
GAAAGGGCAG CTCCGGCCGA AACCGGCCAC CGCAGGGGCA AGGCCTATAT CCGGTTCAAC
GTGGCCGATC GTCCGGGCGT GCTGGCCGAA ATCACCGCCG CCATGCGCGA CGCCGGGGTA
TCGATCGAGA GCTTCATCCA GAAGGGTGGG CAGGACGATG CACCGGTCAT GGTGTCGATG
GTCACGCACG AAGGCCCGGA AAGCGCCATC GCCGAAGCAC TGCGCCTTCT CGATGGCTCG
CCCGTCCTGG CCGAGCCGCC GCTGGTCATG CACATCCTCG GCGAATGA
 
Protein sequence
MSEPLRIALA GLGTVGGGVI RLIEANADLI ARRAGRPIVI TTVSARNRDK DRGFDVSRYA 
WEDDMVILGE RPDVDVVVEL VGGADGPALA LARTTFEAGK ALVTANKAMI AHHGVELATK
AEAAKVALKF EAAVAGGIPV IKGLKEGVAA NEIARVYGIL NGTCNYILST MEDTGRDFAD
VLAEAQAKGY AEADPTFDID GIDAAHKLSI LSSIAFGTAV DFKPVAATGI RRVLAADIAQ
ADVLGYYIRL IGMAETEMDA AGNRRLFQRV HPHLVHRDHP LAHVDGATNA VVAEGNFVGR
LLFQGAGAGD GPTASAVVAD LIDIARGDIG APFSIPVAEL ERAAPAETGH RRGKAYIRFN
VADRPGVLAE ITAAMRDAGV SIESFIQKGG QDDAPVMVSM VTHEGPESAI AEALRLLDGS
PVLAEPPLVM HILGE