Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2932 |
Symbol | |
ID | 3917367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3147585 |
End bp | 3148733 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640445710 |
Product | peptidase M20D, amidohydrolase |
Protein accession | YP_498201 |
Protein GI | 87200944 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.886758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAA TCGCAGATCT CCTTCCGGAC CTCGTGGCCC TGCGCCGCGA CATCCATCAG AACCCCGAAC TCGGCTTCTG CGAACACCGC ACCGCCGCGC GCATCGCCGC AGAACTGCGG GCCAGCGGGA TCGAGGTCCA CGAAGGCATC GGCACCACCG GGCTCGTCGG CGTGCTCAAG GGCACGCGCG AGGGCACTCG CAGCCTCGGT CTGCGCGCCG ACATGGATGC CCTGCCGATC CACGAGCAGA CCAATCTTCC CTGGGCAAGC CGTACCCCCG GCACCTTCCA CGGCTGCGGC CACGACGGCC ACGTCGCGAT GCTGCTCGGC GCCGCCCGCG TGCTGGCTGC CGACCCCGAC TTCGCGGGTA CGGTCAACTT CATCTTCCAG CCCGCCGAGG AGGGGCAGGG CGGGGCGCGC GTGATGATCG AGGAAGGCCT GTTCGACCGC TTCCCCTGCG ACCGCGTCTA TGCGCTGCAC AACTGGCCGG GCCTGCCCGC CGGCACGATC AGCACCCGTC CCGGCGCGAT CATGGGCGCG GCCGACAAGT TCCGCATCGC GCTTGTCGGC AAGGGCGGCC ACGCCGCGAT TCCGCAGGAC AGCCCCGATG CGATCCTTGC CGCCGCCAGC CTTGTCCAGC AACTCAACAC CATCGTCAGC CGTGCCGTGC CGCCCTATGC GGCGGCCGTC CTCTCGATCA CCGAGATCCA CGGTGGCCAC GCCCACAACG TCATCCCGGC GGAAGTCATG GTGGGGGGCA CGGTCCGCAC GTTCGACCCG GCGGTCCAGG ATCGCATCGA GGAACGGATG CGCGCGATGC TGAAAGGCAT CGAGGCTTCG TTCGAGGTCG AATGCGCGCT CGACTACGAC CGTTACTATC CCGCCACGAT CAACGATCCG CAGGCGGCAG CCGATGCGCT GGAAGTGGCC GCGACGGTGG CACGGGCGGA ACTCGCGCCT GAACCCGCGC CGACCTCGGA AGACTTTTCC TTCATGCTCC AGCAGCGCCC CGGCGCCTAC ATCTGGCTGG GGCAGGGCAC GCAGGGCCAC GCCGCGCCGC TCCACAATCC ACACTACGAT TTCAACGATG GCGTCATGGA AACGGGCATC CGTCTCCACG TTGCGCTTGC CCGGCACTGG CTGGGCTGA
|
Protein sequence | MSEIADLLPD LVALRRDIHQ NPELGFCEHR TAARIAAELR ASGIEVHEGI GTTGLVGVLK GTREGTRSLG LRADMDALPI HEQTNLPWAS RTPGTFHGCG HDGHVAMLLG AARVLAADPD FAGTVNFIFQ PAEEGQGGAR VMIEEGLFDR FPCDRVYALH NWPGLPAGTI STRPGAIMGA ADKFRIALVG KGGHAAIPQD SPDAILAAAS LVQQLNTIVS RAVPPYAAAV LSITEIHGGH AHNVIPAEVM VGGTVRTFDP AVQDRIEERM RAMLKGIEAS FEVECALDYD RYYPATINDP QAAADALEVA ATVARAELAP EPAPTSEDFS FMLQQRPGAY IWLGQGTQGH AAPLHNPHYD FNDGVMETGI RLHVALARHW LG
|
| |