Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0965 |
Symbol | purK |
ID | 3931919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | + |
Start bp | 851895 |
End bp | 852944 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637901119 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_506829 |
Protein GI | 88607982 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.676938 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCTTT ATAGTAGGAA AATGCAGCGC ATTGGAATAA TCGGAGGCGG ACAGCTAGGA AAAATGACAG CCATAGCTGC ATATAACCTG GGTTTTAAAG TTTGTGTCCT AGCAGAGAAG GAGAACTCCC CAGCTATAGA TGTAGCCAGA GATTATGTTG TTTCACCTTT CCTCGACAGG TCAGGGATTC TTAACTTTGT AGAACATGTA GACGCTATAA CATTTGAGAG CGAAAACATA CCAACAGAGA CGCTTGATCT ACTCCACGAT AAGTTCGACG TTCCAAATAC TAAAGCGATT AAGGTAGCAC AGGACAGGTT CTTAGAAAAA GAATTCTTAA GGAAAAATGG AATACCAACT ACCGAATATT GGTACATCGA AAAGGAGGAG GACCTTGATA GTGTAGATTT TCCAGCAATA TTAAAAACAA TCAATGGTGG CTATGATGGA AAAGGACAAT TCCTCCTAGA GGATCATGAT TATGTAAGAA GAGAAGCCGG AAACCTCAAA TTCCCTCTAA TAGCAGAAAA GTTATTTAGG ATAAGTAAAG AATTCTCCAT AATAGTTTCG AGAAATGAAA CTGGAAGTGT GTGTTTTCCG ATAGCAGAAA ATGTTCATGT TAATGGAATA CTCAAAACAT CCAGCGTGCC AGCTGTACTT CCTCATCATG TCGCACTTGA AATAAAAAAC ATAGGATTTC AAATAGCAGA TCTATTAGAA ATAAAGGGTC TCTTGTGTGT AGAATTTTTT CTCGACGAAG ATAACAAGCT AGTTGTGAAT GAAATCGCTC CTAGACCACA CAATTCTGGT CACTGGAGCA TGGATTGCTG CGACATTGAC CAATTTGAAG AACTGGTTCT TGCAATTACA GGTAATAAAC TCAAAAAGCC TAATCTCGTA GTTCCGTGCA CAATGAAGAA TATTCTTGGC AATGAAATAA ATACTTGGAA AGATTTATTC CTACAAAAAA ATGTAAAACT CTACAACTAT GGTAAAGAAC AGCCTAAAAT CCTAAGGAAA ATGGGGCATA TAAACATTCT GCATCCGTAA
|
Protein sequence | MDLYSRKMQR IGIIGGGQLG KMTAIAAYNL GFKVCVLAEK ENSPAIDVAR DYVVSPFLDR SGILNFVEHV DAITFESENI PTETLDLLHD KFDVPNTKAI KVAQDRFLEK EFLRKNGIPT TEYWYIEKEE DLDSVDFPAI LKTINGGYDG KGQFLLEDHD YVRREAGNLK FPLIAEKLFR ISKEFSIIVS RNETGSVCFP IAENVHVNGI LKTSSVPAVL PHHVALEIKN IGFQIADLLE IKGLLCVEFF LDEDNKLVVN EIAPRPHNSG HWSMDCCDID QFEELVLAIT GNKLKKPNLV VPCTMKNILG NEINTWKDLF LQKNVKLYNY GKEQPKILRK MGHINILHP
|
| |