Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SO_3555 |
Symbol | purK |
ID | 1171226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella oneidensis MR-1 |
Kingdom | Bacteria |
Replicon accession | NC_004347 |
Strand | + |
Start bp | 3712931 |
End bp | 3714028 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637345354 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | NP_719102 |
Protein GI | 24375059 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAA CAAGCACAAA ACCTAAGGTT TGGGTATTAG GAAATGGCCA ACTCGGTGCC ATGCTAACCC ATGCAGGTGA GCCGCTGGCG ATTGATGTTC GCGCCGTGGA TATAATGACG CCAACGGATG ATATTTTACC CCTCGCGCCA AACGATATTA TTACCGCCGA ACGTGAGCAG TGGCCTGAAT CCGCCTTAAG CTTACAACTC AGCACCCATC CGCATTTTGT TAACGGCCCA GTCTTTAGCC GTTTAGCCGA TCGCTATAGC CAAAAAAGCT TACTCGATCA GCTTAACGTC CCAACAGCAC CTTGGTCACT CGTTGATGAT CACACAAAGG TCGAAACTCT GTATCAAGCG TTTGGCCCAA GAGTTTTAAT GAAGCGCCGC ACTGGCGGTT ATGATGGCAA AGGTCAGCAT TGGCTAAAAC AAGCCGAAGC GGGTGATATC CCCCATGATT GGCGCAACTT AGCCATTGCC GAGCAAGCGA TAAACTTCGA TGAAGAAGTG TCCTTAGTCG GCGTGCGTAC CCGTGAAGGC CAATGCGTGT TTTATCCATT AACACTTAAC CTTCATCAAG ATGGCATTTT GATGGCATCG ATTGCACCAC TGGCACGCTT AGATCATCTG CAAGCCCAAG CCGAAACCAT GCTCAGTGCG ATTATGCATG AGCTGGAATA TGCCGGCGTG ATGGCGATGG AATGCTTCCG TGTTGGCGAC AATCTGCTGG TCAACGAGTT AGCCCCGCGG GTACACAACT CCGGCCATTG GACCCAAGCG GGCACCCATA TGGATCAATT TCAACTGCAT TTAAGAGCCT TGTGTGGGAT TGCGATTCCA CAGCCACAGG TGAACTTTCA ATGTGTGATG GTTAATCTGA TTGGTATCGA CAACGATCCC CGTTGGTTAA GCTTGCCCAA TGCAGAACTC TATTGGTACA ACAAAGAAGT ACGTCCTGGC CGCAAAGTCG GGCATTTAAA TCTTTCGGTG CCTAATCTGA CTGTATTAAC AAACAGCATT AGTGCACTAC AAACGTGGAT GCCAAACCAA TATCAAGCAC CTCTCGCTTG GATTTTGGCT GAGTTTACAA AAAGCTAA
|
Protein sequence | MTQTSTKPKV WVLGNGQLGA MLTHAGEPLA IDVRAVDIMT PTDDILPLAP NDIITAEREQ WPESALSLQL STHPHFVNGP VFSRLADRYS QKSLLDQLNV PTAPWSLVDD HTKVETLYQA FGPRVLMKRR TGGYDGKGQH WLKQAEAGDI PHDWRNLAIA EQAINFDEEV SLVGVRTREG QCVFYPLTLN LHQDGILMAS IAPLARLDHL QAQAETMLSA IMHELEYAGV MAMECFRVGD NLLVNELAPR VHNSGHWTQA GTHMDQFQLH LRALCGIAIP QPQVNFQCVM VNLIGIDNDP RWLSLPNAEL YWYNKEVRPG RKVGHLNLSV PNLTVLTNSI SALQTWMPNQ YQAPLAWILA EFTKS
|
| |