Gene Saro_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3631 
Symbol 
ID5077779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp257419 
End bp259161 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content65% 
IMG OID640481354 
Productamidohydrolase 3 
Protein accessionYP_001166016 
Protein GI146275856 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3653] N-acyl-D-aspartate/D-glutamate deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGATA TTCTTATCAG GAACGGTACC GTCGTGGACG GGACCGGTGC TCCGGCGTTC 
AAGGCGGACG TGCGCGTGCG CGACGGCGTG ATCGCCGAGG TCGGCGAGAA CCTCAGCCCC
AATGGCGAGC GCGTTTTCGA CGCCAGCGGC TGTCACGTCA CCCCCGGCTT CATCGAAAGC
CACACCCACT ATGACGGCAC CATGTGGTGG CAGCCCGATC TCGATCCGCT GCCCGGCTAT
GGCGCGACGA CGATGATCCT CGGCAACTGC GGTTTCTCGC CCGCGCCGCT GCACAAGTAC
ATGCCCGCCC AGCGCGAGAT GATCGGCATC TTCTCGTTCT TCGAGGACAT CCCGGAAGGC
CCGTTCATGC AGAACCTGCC GTGGGACTGG AACAAGTGGT CGGAATACCG CGCCTCGGTG
GAACGCAACG TCAAGGTGCC GCTGAACTAC GCCGCCTATG TCGGCCACAT CGCCATCCGC
CTTGCCGCGA TGGGCGTCGA GGCATGGGAG CGCGAGGCGA CCGCCGAGGA AATCGCCAAG
ATGGCCGAAC TGCTCGACGA CGCGCTTGCC GCCGGCGCGC TCGGCATGTC CGACAACATG
CACGACCATG ACGGACAGGA CCGCCCGGTT CCCACGCTCA AGGCCAACGA CGCCGAGTTC
GAAGCCCTGT TCGACGTGAT GGAGCGCTAC CCCGGTTGCT GCTACCAGGT CATTGTCGAC
ACCTTCATGC GCATGACCGG CCCGGCGAAC CTCGAACGCC TGTCGAAGCT TCTCGCCGGT
CGCAAGATCA AGGTGCAGAT CGCGGGCGCC ATCCCCACGC TTGAATTCCA GAAGGGCATC
CTGCCCGCGA TGCAGGAATC GGTGCGCAAG ATGCGCGAAG CCGGCGTCGA CGTGTGGCCC
GGCTATGCCC ACGTCTCGCC GACCTCGACG CTCAGCCTCG TCAAGTCGCT GATCTTCGCG
CAGTCGAACG ACTACGTCTG GCACGAAGTC GTGCTCGAGG ACGACCATGC CAAGAAGGCG
GCGCTTCTCG CCGATCCGGA ATGGCGCGCC CGTGCCCGCG AAAGCTGGGA TACCCAGGCG
TGGGATCATT CGCCGCTGAA GAACCCGCAG GAACTGTTCC TGCTCGACAG CGAGAACGGC
GCGGGTCCGC TCAACATCAC GCTCAAGGAA TATGCCGACA GCCTCGGCCT GCACCGTTCG
GACGCGATGG CGGACTGGAT CCTCAAGAAC GGCACCCGTT CGACCGTGCA TATGGCGCCC
TTCCCCAAGG ACGAGGCACT GACGCTGGAA CTGATGAAGG ACCCGAAGAC CGTCGGCAAC
ATCTCGGACG CCGGCGCGCA CCTTCAGATG CTTTGCGGTG GCGGCGAGAA CGCGCTGCTG
CTGACCCAGT ACGTCCGCGA GGAAAAGAAG CTTTCGCTGG AACAGGCGAT CCACGTGATG
ACCGGCAAGC TGGCCGGCCA CTTCAACCTC AATGACCGCG GCGTGATCGC GGTGGGCAAG
CGCGCCGACA TCGCCGTGTT CAACATGGAC GAGATCCAGC GTCGCGAGAT GGAAAAGGCC
TTCGACGTTC CCGACGGCCG CGGCGGCACC ACCTGGCGCT TTACCCGTCA GGCGATGCCC
ACCCGCCTCA CCCTGGTGAA CGGCGTTCCG ACTTTCGAGA ACGGCGCCTT CACCGGTGCG
ATGCCGGGCA AGTTCCTCTC CCCCGCGAAC GATGACGCGG CGCTGGCGGA GGCTGCGGAA
TAA
 
Protein sequence
MSDILIRNGT VVDGTGAPAF KADVRVRDGV IAEVGENLSP NGERVFDASG CHVTPGFIES 
HTHYDGTMWW QPDLDPLPGY GATTMILGNC GFSPAPLHKY MPAQREMIGI FSFFEDIPEG
PFMQNLPWDW NKWSEYRASV ERNVKVPLNY AAYVGHIAIR LAAMGVEAWE REATAEEIAK
MAELLDDALA AGALGMSDNM HDHDGQDRPV PTLKANDAEF EALFDVMERY PGCCYQVIVD
TFMRMTGPAN LERLSKLLAG RKIKVQIAGA IPTLEFQKGI LPAMQESVRK MREAGVDVWP
GYAHVSPTST LSLVKSLIFA QSNDYVWHEV VLEDDHAKKA ALLADPEWRA RARESWDTQA
WDHSPLKNPQ ELFLLDSENG AGPLNITLKE YADSLGLHRS DAMADWILKN GTRSTVHMAP
FPKDEALTLE LMKDPKTVGN ISDAGAHLQM LCGGGENALL LTQYVREEKK LSLEQAIHVM
TGKLAGHFNL NDRGVIAVGK RADIAVFNMD EIQRREMEKA FDVPDGRGGT TWRFTRQAMP
TRLTLVNGVP TFENGAFTGA MPGKFLSPAN DDAALAEAAE