Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0872 |
Symbol | |
ID | 5704537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 976789 |
End bp | 977970 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270391 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_001535781 |
Protein GI | 159036528 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.803514 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0297907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTCCC ACACCGGTCT GCCCCTTGTC GGCATGGTGG GCGGCGGTCA ACTGGCCCGG ATGACCCATC AGGCCGCGAT CGCCCTCGGC CAGTCGCTGC GGGTGCTCGC GCTCGCTCCC GACGACAGTG CTGCCCTGGT GGCCGCCGAC GTGCAGTACG GCGACCACAC CGACCTGGCG GCACTGCGCA CCTTTGCCAA GGGCTGTGAC GTGGTCACCT TCGACCATGA GCACGTTCCC ACCGAGCACA TCGACGCCCT CGCCGACGAA GGCGTCAAGC TGTTCCCGCC GGCCGAGGCG CTGGTGCACG CACAGGACAA GCAGGTCATG CGGGAACGTC TCGCCGGGTT GGGCATGCCG AACCCGGCCT GGCGGCCGGT CGACACTCCG GCTGACGTCG AGTCCTTCGG TGACGCGGTG GGCTGGCCGG TGGTGCTCAA GGCGGCCCGG GGTGGCTACG ACGGCCGGGG CGTGTGGCTG GTGGACGACG CCGCCGGGGC GGTTGAGCGA ACGGCCACGC TGCTGGCCGC AGGGACGCGC CTCATCGTCG AGGAGCGGGT GGCGCTGCGC CGGGAACTGG CCGTGCAGGT GGCCCGTTCA CCGTTCGGGC AGGTCGCCGT GTATCCGGTG GTCGAGACCG TGCAACGGGA CGGCGTCTGC GTCGAGGTCC TGGCCCCCGC ACCAGACCTG CCGGAGGAGT TGGCGGTCGG TGCGCAACAG CTCGCTATCG ATCTGGCCAC CGCGCTCGGC GTGGTGGGGC TGCTCGCCGT CGAGTTGTTC GAGGTGGCCG ACCCGGCCGA GGTGACGGGC AGTCGGCTCG TGGTCAACGA GTTGGCGATG CGTCCGCACA ACTCCGGGCA CTGGACGATC GAGGGCGCCC GGACGTCGCA GTTCGAGCAG CACCTACGGG CGGTGCTTGA CTATCCGATG GGGGACACCT CCCTGGCCGC GCCGATCGTG GTGATGGCGA ACGTGCTGGG CGGCGAGCCG GGAGGTATGT CCTTCGACGA GCGCCTGCAC CACCTGTTCG CTGCCGAGCC GGGCGCGCAG GTGCACCTGT ACGGCAAGCA GGTGCGCCCA GGTCGCAAGA TCGGCCATGT CACGGTGCTC GGCGACGACC TGGACGAGGT ACGTACCCGG GCGGCGCGCG CGGCCCGTTG GCTGCGGGAG GGGCGCGGAT GA
|
Protein sequence | MDSHTGLPLV GMVGGGQLAR MTHQAAIALG QSLRVLALAP DDSAALVAAD VQYGDHTDLA ALRTFAKGCD VVTFDHEHVP TEHIDALADE GVKLFPPAEA LVHAQDKQVM RERLAGLGMP NPAWRPVDTP ADVESFGDAV GWPVVLKAAR GGYDGRGVWL VDDAAGAVER TATLLAAGTR LIVEERVALR RELAVQVARS PFGQVAVYPV VETVQRDGVC VEVLAPAPDL PEELAVGAQQ LAIDLATALG VVGLLAVELF EVADPAEVTG SRLVVNELAM RPHNSGHWTI EGARTSQFEQ HLRAVLDYPM GDTSLAAPIV VMANVLGGEP GGMSFDERLH HLFAAEPGAQ VHLYGKQVRP GRKIGHVTVL GDDLDEVRTR AARAARWLRE GRG
|
| |