Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1889 |
Symbol | purM |
ID | 5712881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1966227 |
End bp | 1967288 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641267813 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001533232 |
Protein GI | 159044438 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0381482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.145753 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCG ACACACCGCC GCCGAAACCC GGTCTGACAT ATGCCGAGGC CGGCGTCGAC ATCGATGCGG GCAACACCCT GGTGGACCGG ATCAAGCCCG CCGCCAAGGC CACCTCTCGC CCGGGCGTGA TGAGCGGTCT GGGCGGTTTC GGCGCACTGT TCGACCTTAG GGCCGCGGGC TACGCCGACC CGGTGCTGGT GGCCGCCACG GACGGGGTCG GCACCAAGCT GCGGATCGCC ATCGACACCG GCCATGTCGA CACGATCGGG ATCGACCTGG TAGCGATGTG CGTCAACGAC CTCGTGTGCC AGGGGGCTGA ACCGCTGCTT TTCCTGGACT ATTTCGCCAC CGGAAAGCTC GACGTGGCCG AGGCCGCGAC GATCGTCGAG GGTATCGCCC GGGGCTGCGC CACTTCCGGC TGCGCGCTGA TCGGCGGCGA AACCGCCGAG ATGCCGGGCA TGTATGCCAA GGGCGATTTC GACCTCGCGG GCTTTGCCGT CGGCGCGATG GAGCGGGGCG GCGCGTTGCC CGCGAATGTG GCGGCAGGGG ACATGATCCT CGGGCTGGCC TCGGACGGGG TCCATTCCAA CGGCTACTCG CTGGTGCGTC GGATCGTCGA GCGCTCCGGT CTGGGCTGGG GCGATCCCGC ACCGTTCGAG GGCCGGACTC TCGGCGCGGC CCTGCTGACG CCCACGCGGC TCTACGTGCA ACCGGCGCTG GCGGCGATCC GCGCGGGCGG GGTGCACGGG CTGGCCCATG TCACCGGCGG CGGGCTGACC GAGAACCTGC CCCGGGTGCT GCCCGAGGGG CTGGGGATCG AGATCAACCT CGGCGCGTGG GAATTGCCGC CGGTGTTCCG CTGGCTCGCC GCCGAGGGCG GGCTCGACGA GGCCGAACTG CTCAAGACCT TCAACGCCGG GATCGGCATG GCCCTGATCG TGGCGCCCGA CCGGGCCGAG GCGCTCGCGG ACCTGCTGGC CGGGGCGGGC GAGCGTGTGG CGGTGATCGG CCATGTCACC GAAGGCGCGG GCGCCGTGCA CTATCGCGGG ACGCTTCTTT GA
|
Protein sequence | MTTDTPPPKP GLTYAEAGVD IDAGNTLVDR IKPAAKATSR PGVMSGLGGF GALFDLRAAG YADPVLVAAT DGVGTKLRIA IDTGHVDTIG IDLVAMCVND LVCQGAEPLL FLDYFATGKL DVAEAATIVE GIARGCATSG CALIGGETAE MPGMYAKGDF DLAGFAVGAM ERGGALPANV AAGDMILGLA SDGVHSNGYS LVRRIVERSG LGWGDPAPFE GRTLGAALLT PTRLYVQPAL AAIRAGGVHG LAHVTGGGLT ENLPRVLPEG LGIEINLGAW ELPPVFRWLA AEGGLDEAEL LKTFNAGIGM ALIVAPDRAE ALADLLAGAG ERVAVIGHVT EGAGAVHYRG TLL
|
| |