Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3412 |
Symbol | |
ID | 5077561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 13126 |
End bp | 14343 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640481136 |
Product | amidohydrolase |
Protein accession | YP_001165798 |
Protein GI | 146275638 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGC TCATCCGCGA CGTGCGCATT TTCGACGGCC AGACAATGCA AGCCGGCAAC CGTTCGGTCC TTGTCGAAGG CGGCCGCATC GCGGCCATTG GCGAAACCGC AGATGCGTTG GACAACGCGG GCGCGGAAAC TGTTGTCGAA GGCAAGGGTC GCACGCTCAT GCCCGGCATG GTCGAGGCCC ACGCGCATCT CACCTGGGCA TCCTCGGTCG AGAAGATCTA CCACCAGTTC ATCCTGCCGC CCGAAGAACT CAAAGTCGCG GCCTGGCGCA ATGCACGCGT CCTGCTCGAC CACGGCTTCA CCAGCGCCTA TTCAGCGGGC GCGCTGGGCG ATGGCATCGA GGTGGAGCTC GCCAAGGCCA TCGAAGCGGG CGAGACGCCG GGACCGCGCC TGGTCCCCTC CACTCTGGAA CGCAGCCCCG AAGGCGCCGA GGGCGTGGAG ACCGGCGACG TGTTCAACGG GCGTGGGCCC GACGCGATCC GCAAGTTCGT CACCTATTGC AAGGACCAGG GCATCGGCTC GCTCAAGCTG GTCGTGTCCG GCGAGGATGC GCTGAAGCCG GGATCGGCGG GCGACGTGCT CTACACCGAC GAGGAAATGG AAGCGGCCGG CGTCGCGGCG CGCGAAGCGG GCCTGTGGAT CGCCACCCAC GCCTATTACC CCAAGGCCAT CGAACTGGCG CTCAAGGCCG GCGCGCGCAT CATCTACCAC GCCTCGTATG CCGACGAGGC GGCGGCCGAC GCGATGGTCG CGGCAAAGGA CGCGACGTTC TATGCCCCCT CGCCCGGAGT CTCGGTTGCC GCGCTGGAAG CCACGCCCCC GCCGCACATC GACATGAGCC ACATGAAGAA AAGCGCGGCG GAGCGAATGG AACTTGAAAG CAGGCTCGTG CCCGCACTCA AGGCGCGCGG CGTGCGCATC CTGATCGGCG GAGACTATGG CTTTCCGTTC AATCCCAACG GCCGCAACGC CCGCGACCTC GAAATCTTCG TCGAACACTT CGCGTATACG CCCGCCGAGG CACTGACCGC CGCGACGAAG CTCGGCGGCG AACTGATGGG CATAGAGGTG GGACAGGTCC GCGAAGGCTA CCTGGCGGAC CTCCTGCTGG TCGATGGCGA TCCGACCCAG GACGTGGGGC TGCTCCAGGA CAAGAACCGG CTGGCCATGA TCATGAAGGG CGGCGCGATC TACAAGGCGG CAGCATGA
|
Protein sequence | MATLIRDVRI FDGQTMQAGN RSVLVEGGRI AAIGETADAL DNAGAETVVE GKGRTLMPGM VEAHAHLTWA SSVEKIYHQF ILPPEELKVA AWRNARVLLD HGFTSAYSAG ALGDGIEVEL AKAIEAGETP GPRLVPSTLE RSPEGAEGVE TGDVFNGRGP DAIRKFVTYC KDQGIGSLKL VVSGEDALKP GSAGDVLYTD EEMEAAGVAA REAGLWIATH AYYPKAIELA LKAGARIIYH ASYADEAAAD AMVAAKDATF YAPSPGVSVA ALEATPPPHI DMSHMKKSAA ERMELESRLV PALKARGVRI LIGGDYGFPF NPNGRNARDL EIFVEHFAYT PAEALTAATK LGGELMGIEV GQVREGYLAD LLLVDGDPTQ DVGLLQDKNR LAMIMKGGAI YKAAA
|
| |