Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2898 |
Symbol | |
ID | 4027906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3228734 |
End bp | 3229828 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637968106 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_574943 |
Protein GI | 92115015 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.52594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATATCG GTATCGTGGG CGGCGGCCAA TTGGGCCGGA TGCTGGCCCA AGCGGGGGCG CCGCTGGACA TGCGCTTCAC CTTCCTCGAT CCCTCGACCC AGGCGTGCGC CGCCTCGCAG GGCGTGCACC TCTGCGCCGA TTGGGACGAC GAAGAAGCGC ACCAGGCACT CATCGAAGCC AGCGACGTGA TCACCTTCGA GTTCGAGAAC GTCTCGACCA TGGCGCTGAC GCAGCTCGCC GAGCAGCGCC CGGCCTTCCC GCCGGCCAAG GCACTGGAAA CCGCGCGCGA CCGTGGCGCC GAGAAATCGC TGTTCCAGTC GCTGGACATC GCCATTGCCC CCATCGCGCT GATCGAGAGC CAGGAAGATC TGGACCGCGC GGTGGCCGAG ATCGGCCTGC CGGCGGTGCT CAAGACGCGC ACCCTGGGCT ACGACGGCAA GGGCCAGAAG GTGCTGCGCA GCGCCGACTA CGTGCCCGGC AGCGTCGCCG AGCTGGGTGA CGTGCCCTTG ATCCTGGAAG GCTTCATCGA CTTCGATCAT GAAGTCTCGG CCATCGCCGT GCGCGGCCGC GACGGCGAGG TGCGGGTGTG GCCGCTGTCA CGCAACGAGC ACCGCCAGGG CATCCTGCAT CGTGCCGAGC CGCAGCCCGA CCACCCGCTG TACGCCCGCG CCGCCGACTA CGCCACCCGC GTGCTCGACG CGCTGGAGTA CGTCGGGGTC ATGGCCTTCG AGTTCTTCGT CACCCGCGAC GGCGAGCTGC TCGCCAACGA AATCGCCCCC CGCGTGCACA ACTCCGGGCA CTGGAGCATC GAAGGCGCGA CGACCAGCCA GTTCGAGAAT CACCTGCGCG CCGTCGCCGG GCTGCCGCTG GGCGACACGA CACGCCTCGT GCCCTGCGCC ATGCTCAACA TCATCGGCGC CTTCCCCGAC CGCGACGCCG TGCTCAGTGT CGCCGGCGCA CGCCTGCACG ACTACGACAA GGCCCCGCGC CCCGGCCGCA AGATCGGCCA TGTCACGGTT CTCGCACCCG ACGAGGCCAC CCTGGCCGAG CGCGTCGCCG CCGTCGAGGC CCTGCTGGTC AACGACCTGG GCTGA
|
Protein sequence | MHIGIVGGGQ LGRMLAQAGA PLDMRFTFLD PSTQACAASQ GVHLCADWDD EEAHQALIEA SDVITFEFEN VSTMALTQLA EQRPAFPPAK ALETARDRGA EKSLFQSLDI AIAPIALIES QEDLDRAVAE IGLPAVLKTR TLGYDGKGQK VLRSADYVPG SVAELGDVPL ILEGFIDFDH EVSAIAVRGR DGEVRVWPLS RNEHRQGILH RAEPQPDHPL YARAADYATR VLDALEYVGV MAFEFFVTRD GELLANEIAP RVHNSGHWSI EGATTSQFEN HLRAVAGLPL GDTTRLVPCA MLNIIGAFPD RDAVLSVAGA RLHDYDKAPR PGRKIGHVTV LAPDEATLAE RVAAVEALLV NDLG
|
| |