Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0680 |
Symbol | purK |
ID | 5711516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 686761 |
End bp | 687834 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641266589 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001532027 |
Protein GI | 159043233 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.616894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAGC CGCTGGCCCC CGGGGCGGTG ATCGGGATGC TGGGCGGCGG GCAGTTGGGG CGGATGCTGT CGATGGCCGC GGCGCGGCTG GGGTTTCGGT GCCATATCTT CGAGCCGGGG GCCGCCCCGC CTGCGGGCCA GGTGGCCGAG GCGGTGACCA CCGCCGGCTA TGACGATCTG GACGCGCTGC GACGCTTCGC CGAGGTCGTG GACGTGATCA CCTACGAGTT CGAGAACATC CCCACCGCCG CGCTCGACGT GCTGGAGGCG CTGCGCCCGA TCCATCCCGG CCGACGCGCC CTGGCCGTGA GCCAGGACCG GCTGACGGAG AAGGAATTCC TGCGCGGGCT CGGTCTGCAA ACCGCGCCCT TCGCGCCCGT CGACGACGCC GCGGGGCTGG AGGCCGCCCT TGCCGCCATC GGCACGCCCG CGATCCTCAA GACCCGGCGA CTGGGCTATG ACGGCAAGGG CCAGACGCGG CTGACTGCGC CCGGCGACGC GGCGGAGGCG CTCGCGGCCA TGGCCGGGGC GCCCGCGATC CTGGAAGGGT TCGTGGAGTT CTCCCACGAG GTGTCGGTGA TCGCGGCGCG CGGGCAGGAC GGGGCCGTGG CCTGTTTCGA TCCCGGCGAG AACGTGCACC GGGACGGCAT TCTGGCCACG ACCACGGTGC CCGCGCGGCT CAGCCATGCC CAGCGCACGG ATGCAGTGCT GCTGGCGGGG CGGATCCTGA ACGCGCTCGA TTATGTCGGG GTGATGGGGG TGGAGCTGTT CGTGACCCGC GGCGGGCTGA TCGTGAACGA GATCGCGCCG CGGGTGCACA ATTCCGGCCA CTGGACCCAG ACCGGCTGCG TCATAGACCA GTTCGAGCAG CATATCCGCG CCGTGGCGGG CTGGCCCCTC GGCGACGGGC AGCGCCATGC GGATGTGGTG ATGGAGAACC TCATCGGGTC GGACATGGAC CGGGTGCCGG AGCTGGCCGC GGCGCGCGAC GTGGCCCTGC ATCTCTATGG CAAGGCCGAG ACCAGGGCGG GCCGCAAGAT GGGCCACGCC AACCGGATCA TGCGCCCGGG CTGA
|
Protein sequence | MTEPLAPGAV IGMLGGGQLG RMLSMAAARL GFRCHIFEPG AAPPAGQVAE AVTTAGYDDL DALRRFAEVV DVITYEFENI PTAALDVLEA LRPIHPGRRA LAVSQDRLTE KEFLRGLGLQ TAPFAPVDDA AGLEAALAAI GTPAILKTRR LGYDGKGQTR LTAPGDAAEA LAAMAGAPAI LEGFVEFSHE VSVIAARGQD GAVACFDPGE NVHRDGILAT TTVPARLSHA QRTDAVLLAG RILNALDYVG VMGVELFVTR GGLIVNEIAP RVHNSGHWTQ TGCVIDQFEQ HIRAVAGWPL GDGQRHADVV MENLIGSDMD RVPELAAARD VALHLYGKAE TRAGRKMGHA NRIMRPG
|
| |