Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0899 |
Symbol | |
ID | 3917985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 955550 |
End bp | 956656 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640443633 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_496178 |
Protein GI | 87198921 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.514868 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACG AGCAGAACAA GCCAGAGAGC TACAGCTACG CCAAAGCCGG CGTGAACATT GCTGCGGGCA ACGCATTGGT GAAGGCCATA GGCCCCCTTG CGAAGTCCAC CGCCCGCCCC GGCGCGGACG CGGAACTCGG CGGCTTTGGC GGCTTCTTCG ATCTCAAGGC CGCTGGCTAC AACGACCCTC TGCTGGTCGC CGGCAACGAC GGCGTGGGCA CCAAGGTCAA GCTCGCCATC GACCATGACC GGCACGACCA GATCGGCATC GATCTTGTCG CGATGTGCGT CAACGACCTC ATCGTCCAGG GCGCGGAGCC CCTGTTCTTC CTCGACTATT TCGCCACCGG ACGTCTCGAT AACGGCGTTG CCGAACGTGT CGTCGCGGGC ATCGCCGATG GCTGCAAGCT GGCCGGTTGC GCCCTCATCG GCGGCGAGAC CGCCGAAATG CCCGGCATGT ATGCCGACGG CGACTACGAC CTTGCCGGCT TCTGCGTCGG TGCGGTCGAA CGGGGCGAGC AACTGACCGG AGACCGCGTG GCCGAAGGCG ACGTGCTGCT CGGCCTCGCT TCCTCGGGCG TCCATTCCAA CGGCTATTCC CTCGTCCGCC GCCTTGCCGC CGACAAGGGC TGGAAGCTCG ATCGCCCGGC CCTGTTCGAC AACGAGCGCC TGTTGATCGA TTACCTGATC GAACCGACCC GCATCTATGT GAAGAGCCTG CTGCCCTTCA TCCGCAGCGG CCGGATCAAC GCGCTGGCCC ACATCACCGG CGGCGGCCTG CTTGAGAACG TCCCGCGCGT CCTGCCGCGT GGCCTCCACG CCCGGATCGA CGCCGACAGC TGGGAACAAA GCCGCCTGAT GGCCTTCCTC CAGGCCCAGG GCAACATCGA GCCCGAGGAA ATGGCCCGCA CCTTCAACTG CGGCATCGGC ATGATCCTCG CGGTCAATCC GGCACAGGCG GATGCGCTTG CCGCAGACCT CGCCGCCGCC GGCGAGACGG TCTACCGCGT CGGCACCATC GTGAAGGGCG AAAAGGGCTG CACCGTCACC GGAAGCGCCG AGACCTGGTC CGCCCGCTCG GCCTGGGAAG CCACCCACAT TGGCTGA
|
Protein sequence | MSDEQNKPES YSYAKAGVNI AAGNALVKAI GPLAKSTARP GADAELGGFG GFFDLKAAGY NDPLLVAGND GVGTKVKLAI DHDRHDQIGI DLVAMCVNDL IVQGAEPLFF LDYFATGRLD NGVAERVVAG IADGCKLAGC ALIGGETAEM PGMYADGDYD LAGFCVGAVE RGEQLTGDRV AEGDVLLGLA SSGVHSNGYS LVRRLAADKG WKLDRPALFD NERLLIDYLI EPTRIYVKSL LPFIRSGRIN ALAHITGGGL LENVPRVLPR GLHARIDADS WEQSRLMAFL QAQGNIEPEE MARTFNCGIG MILAVNPAQA DALAADLAAA GETVYRVGTI VKGEKGCTVT GSAETWSARS AWEATHIG
|
| |