Gene Saro_3412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3412 
Symbol 
ID5077561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp13126 
End bp14343 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content66% 
IMG OID640481136 
Productamidohydrolase 
Protein accessionYP_001165798 
Protein GI146275638 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGC TCATCCGCGA CGTGCGCATT TTCGACGGCC AGACAATGCA AGCCGGCAAC 
CGTTCGGTCC TTGTCGAAGG CGGCCGCATC GCGGCCATTG GCGAAACCGC AGATGCGTTG
GACAACGCGG GCGCGGAAAC TGTTGTCGAA GGCAAGGGTC GCACGCTCAT GCCCGGCATG
GTCGAGGCCC ACGCGCATCT CACCTGGGCA TCCTCGGTCG AGAAGATCTA CCACCAGTTC
ATCCTGCCGC CCGAAGAACT CAAAGTCGCG GCCTGGCGCA ATGCACGCGT CCTGCTCGAC
CACGGCTTCA CCAGCGCCTA TTCAGCGGGC GCGCTGGGCG ATGGCATCGA GGTGGAGCTC
GCCAAGGCCA TCGAAGCGGG CGAGACGCCG GGACCGCGCC TGGTCCCCTC CACTCTGGAA
CGCAGCCCCG AAGGCGCCGA GGGCGTGGAG ACCGGCGACG TGTTCAACGG GCGTGGGCCC
GACGCGATCC GCAAGTTCGT CACCTATTGC AAGGACCAGG GCATCGGCTC GCTCAAGCTG
GTCGTGTCCG GCGAGGATGC GCTGAAGCCG GGATCGGCGG GCGACGTGCT CTACACCGAC
GAGGAAATGG AAGCGGCCGG CGTCGCGGCG CGCGAAGCGG GCCTGTGGAT CGCCACCCAC
GCCTATTACC CCAAGGCCAT CGAACTGGCG CTCAAGGCCG GCGCGCGCAT CATCTACCAC
GCCTCGTATG CCGACGAGGC GGCGGCCGAC GCGATGGTCG CGGCAAAGGA CGCGACGTTC
TATGCCCCCT CGCCCGGAGT CTCGGTTGCC GCGCTGGAAG CCACGCCCCC GCCGCACATC
GACATGAGCC ACATGAAGAA AAGCGCGGCG GAGCGAATGG AACTTGAAAG CAGGCTCGTG
CCCGCACTCA AGGCGCGCGG CGTGCGCATC CTGATCGGCG GAGACTATGG CTTTCCGTTC
AATCCCAACG GCCGCAACGC CCGCGACCTC GAAATCTTCG TCGAACACTT CGCGTATACG
CCCGCCGAGG CACTGACCGC CGCGACGAAG CTCGGCGGCG AACTGATGGG CATAGAGGTG
GGACAGGTCC GCGAAGGCTA CCTGGCGGAC CTCCTGCTGG TCGATGGCGA TCCGACCCAG
GACGTGGGGC TGCTCCAGGA CAAGAACCGG CTGGCCATGA TCATGAAGGG CGGCGCGATC
TACAAGGCGG CAGCATGA
 
Protein sequence
MATLIRDVRI FDGQTMQAGN RSVLVEGGRI AAIGETADAL DNAGAETVVE GKGRTLMPGM 
VEAHAHLTWA SSVEKIYHQF ILPPEELKVA AWRNARVLLD HGFTSAYSAG ALGDGIEVEL
AKAIEAGETP GPRLVPSTLE RSPEGAEGVE TGDVFNGRGP DAIRKFVTYC KDQGIGSLKL
VVSGEDALKP GSAGDVLYTD EEMEAAGVAA REAGLWIATH AYYPKAIELA LKAGARIIYH
ASYADEAAAD AMVAAKDATF YAPSPGVSVA ALEATPPPHI DMSHMKKSAA ERMELESRLV
PALKARGVRI LIGGDYGFPF NPNGRNARDL EIFVEHFAYT PAEALTAATK LGGELMGIEV
GQVREGYLAD LLLVDGDPTQ DVGLLQDKNR LAMIMKGGAI YKAAA