Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2823 |
Symbol | |
ID | 3916983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3046851 |
End bp | 3048119 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640445602 |
Product | amidohydrolase |
Protein accession | YP_498093 |
Protein GI | 87200836 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.750141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCGCT ATCTGCTTCC GCTTGTCGCC CTGATCCCTG CAACCGCCAG TGCCGAGACG CTTTACGTGC GTGCCGGGCG GCTGGTCGAC CCCGAGGCGG GCAAGGTGCT GACGGGCCAG GTGCTCACGG TCGAGGACGG GCGGGTTGTT TCGGTGCAGG GGGACGGGCC GTTGCCGCAA GGGGCGAAGG TGGTGGACTG GTCGGGGTTT ACGGTCCTGC CGGGGCTGAT CGATTGCCAT GTGCATCTGG CCGACGTGGA GCAATCGAAC AACGTGGCCG AGCCGCTGCT CCATTCGGCG ATGGAGATCG GGTTCATCGG TGCGAGGAAT GCGAAGAAAA CGCTGCTGGC CGGTTTCACC ACGGTCCACG ATGTCGGGAG TTTCCGCGCC TATGCCGACG TCGAACTGCG CAATGCGATC AACCGGGGAG ATGTGCCGGG CCCGCGGGTG AGCGCGGTGG GGGCCTATGT TACGATCCCC GGCGGCGGGG GCGAGGTGAC CGGGTTCGCG CCCGACGTGA CGGTGCCGGC CGACATGCGC GCGGGCGTCG TGAACGATGC GGCCGACGTG ACCCGCAAGG TCAATGCGCT GTTCCAGAAC GGCGCGGATT CGATCAAGTT GATCGCGACG GGCGCGGTGC TGGCGCAGGG GACCGAGCCG GGGCAGATCG AGCTTTCGGG CGAGATGATG AAGGCGGCGG TGGACGTTGC CCGCCAGCGC GGATCATGGG TGACCGCCCA TGCCCATGGT GCGGCGGGGA TCAAGCTTGC CATCCAGTCC GGCGTCAAGG CGATCGAGCA TGCGAGCCTG ATCGACGACG AAGGGATCGC GCTGGCCAGG GCGCGGGGCG TGTGGCTCGA CATGGATATC TACAACGGCG ACTTCATTGC CGAGGTGGGC AAGCGCGATG GCTGGCCCGC CGACATGCTG CGCAAGAACG ACGAGACGAC CGAAGCGCAG CGGGAGGGCT TTCGCAAGGC GGTGAAGGCG GGCGTCAGGT TGAGCTACGG CACGGATGCG GGGGTCTTTC CGCACGGGCT GAACGCGCGA CAGTTCAGGT ACATGGTGCG ATACGGGATG ACGCCGATGC AGGCGATCCA GTCTGCGACG ACCGTGGCGG CCGATCTGCT GGGGTGGAGC CGCGACGTGG GCGCGCTGTC GCCCGGCCAC TATGCCGACA TGATCGCGGT GGCGGGCGAT CCGCTGGCCG ACGTGAGCGT GCTGGAGAAC GTCGCGCACG TGATGAAGGG CGGCGAGGTG GTGCGCTAA
|
Protein sequence | MLRYLLPLVA LIPATASAET LYVRAGRLVD PEAGKVLTGQ VLTVEDGRVV SVQGDGPLPQ GAKVVDWSGF TVLPGLIDCH VHLADVEQSN NVAEPLLHSA MEIGFIGARN AKKTLLAGFT TVHDVGSFRA YADVELRNAI NRGDVPGPRV SAVGAYVTIP GGGGEVTGFA PDVTVPADMR AGVVNDAADV TRKVNALFQN GADSIKLIAT GAVLAQGTEP GQIELSGEMM KAAVDVARQR GSWVTAHAHG AAGIKLAIQS GVKAIEHASL IDDEGIALAR ARGVWLDMDI YNGDFIAEVG KRDGWPADML RKNDETTEAQ REGFRKAVKA GVRLSYGTDA GVFPHGLNAR QFRYMVRYGM TPMQAIQSAT TVAADLLGWS RDVGALSPGH YADMIAVAGD PLADVSVLEN VAHVMKGGEV VR
|
| |