Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0840 |
Symbol | |
ID | 8332170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 974707 |
End bp | 975843 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644953990 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_003111614 |
Protein GI | 256390050 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0378417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000000944832 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTAGGAG TCGTCGGCGG CGGCCAGCTG GCGCGCATGA TGCAGCAGGC CGCGATCGGT CTCGGGGTCG AGCTGCGGGT GCTCGCGCAG CGGCCCGACG ATCCGGCGGC GCGGGTGACC CCGGGCACCG TGATCGGGGA CCACCACGAC TTCGAAGCGT TGAAGGCCTT CGCCGCCGGC TGCGACGTGC TGACCTTCGA CCACGAGCAC GTCCCCACCG ACTTCCTGCA CGAGCTGGAG GCCGCCGGCG TCGCCGTGCG CCCCGGCCCG GACGCGCTCG TCTACGCGCA GGACAAGGGT CTGATGCGCC AGCGGCTCTC CGCCCTCGGG CTGCCCTGTC CGCAGTGGGC ACTGATCTCC TCTGCCGACG ACCTCGCCGA CTTCGGCGCC ACGGTCGGTT TCCCCTTTGT CCTCAAGGCG ACGCGCGGCG GGTACGACGG CCGCGGCGTC TGGGTCGTCG ACGACCTGGA CGCGGCGAAG GCCGTGCTGG ACGGCGCCGC CGAGCGCGGG GTGGCGCTGC TGGCCGAGGC CAAGGTGCCC TTCGTCCGCG AGCTGTCGGC GCAGGTCGCC CGCTCCCCGC ACGGCCAGGC CGCGGCCTAC CCGGTGGTCG AGTCGCTGCA GATCGACGGC ATCTGCCGCG AGGTTTACGT GCCGGCGCCC GGGCTGTCCG AGGTCGCCGC GGTCGAGGCG CAGCGGATCG CGCTGACGAT CGCCAAGGAG CTGGGCGTCA CCGGCATGCT CGCGGTGGAG ATGTTCGAGA CCGCCGACGG CTCGGTGCTG ATCAACGAGC TGGCGATGCG GCCGCACAAC TCCGGGCACT GGAGCATGGA CGGCGCGGTC ACCGGCCAGT TCGAGCAGCA TCTGCGCGCC GTGCTGGACC TGCCGCTGGG GCAGGTGAAG CCGGTCGCGC CGGTGGTCGT CATGGCGAAC GTGCTGGGTC TGGACCTGCC GGAGGTCTAC CCGGCGTACA AGCACGTGAT GGCGCACGAC CCGGGGGTCA AGGTGCACAT GTACGGCAAG GACGTGAAGC CGGGCCGCAA GATCGGCCAT GTCAACGTCC TGGGTACGGT GTTCGACGAC GTTGCCGACC GGGCCCGCCA CGCCGCGGCC TATCTGCGAG GAGAGATCGA TGAGTGA
|
Protein sequence | MVGVVGGGQL ARMMQQAAIG LGVELRVLAQ RPDDPAARVT PGTVIGDHHD FEALKAFAAG CDVLTFDHEH VPTDFLHELE AAGVAVRPGP DALVYAQDKG LMRQRLSALG LPCPQWALIS SADDLADFGA TVGFPFVLKA TRGGYDGRGV WVVDDLDAAK AVLDGAAERG VALLAEAKVP FVRELSAQVA RSPHGQAAAY PVVESLQIDG ICREVYVPAP GLSEVAAVEA QRIALTIAKE LGVTGMLAVE MFETADGSVL INELAMRPHN SGHWSMDGAV TGQFEQHLRA VLDLPLGQVK PVAPVVVMAN VLGLDLPEVY PAYKHVMAHD PGVKVHMYGK DVKPGRKIGH VNVLGTVFDD VADRARHAAA YLRGEIDE
|
| |