Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2175 |
Symbol | |
ID | 5054791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1944637 |
End bp | 1945725 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640469727 |
Product | phosphoribosylaminoimidazole carboxylase |
Protein accession | YP_001154373 |
Protein GI | 145592371 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.470665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTCT ACGTCCTCGG CGGGGGTCAG CTGGCGCTTA TGATGTGCTG GGAGTCGCAG AGGGTCCCGG TACACTTCTC GGTGTACGAC CCCGACCCCT CCGCCCCGGC CTACAGATGC GCGAAGAGGC CCTCAGACCC GCTTGAGGAG GTGGACAAGG CCGATGCCGT CACCTTCGAA TTTGAGAACG TGGACATCTC GCTGGCGGAG CGGGCTGAGA AGCTGGGCAA GCTTAGGCCC CCGCTGGCCT ACCTGAAAGT TAAGAAGAGC AGGATCGAGG AGAGAGCCCT CATGGACTCT ATCGGCGTGC CCACTGTGCC GTGGCGCCGG GCGGCTAATT GGGAGGAGGC TATTAAAATA GCGGAGTCCA TGGGTAGGGC ATATGTAAAG GTGCCCACCG GCGGCTACGA CGGGAAGGGG CAGTACATAT ACCCCCACGA GGCCGCGTTG ATAAGGGGGC TGGCTGGGGA GCTCCTGGTA GAGGAGTACG TAGACATCAG GAGGGAGTTC TCCATTGTGG CCGCGAGGGC GGAGAACGGC GATGTCTACT TCTACCCACC TGCCGAGAAC TTCTACGTAC ACGGCATCTT GGTCTGGAAC TACGCTCCGA CAAAAGTGCC GGAAGAGGCG TATGAATACG TCAACCGCAT ATTGGAGTGG GGGAAGTACG TTGGGGTCAT CGCAGTGGAG TTCTTCGAGG CTAGGGACGG CCGCGTTTTG GTCAACGAGA TAGCCCCCAG GGTCCACAAC ACGGGGCACT GGACCCTGGA GACAGACGCC AGCCAGTTCG AGAACCACGT CCGCGCCGTC TTAGGCCTCC CGCTGAGGAG ACCCCGCGCT ATGGCCCCTA CGGCGATGGT AAACATCTTG GGCGTCGGCC TGGGAAAGCT ACCGCTGGCG GAGCTGGAGC GGCGGGGGCG GGTGTACTGG TACTACAAGG CTGAGGCTAG GCCGAGGCGG AAAATGGGCC ACCTCAACAT AACGGCTGGG TCAGTGGAGG AGGCGATCAC AAAGGCGAGG GAGGCGCTGA GGCTGATATA TGGAGCGGAT TTCCCAAGGC TTGTGATGAG GTCGAGGCCT AGCCCTTGA
|
Protein sequence | MRLYVLGGGQ LALMMCWESQ RVPVHFSVYD PDPSAPAYRC AKRPSDPLEE VDKADAVTFE FENVDISLAE RAEKLGKLRP PLAYLKVKKS RIEERALMDS IGVPTVPWRR AANWEEAIKI AESMGRAYVK VPTGGYDGKG QYIYPHEAAL IRGLAGELLV EEYVDIRREF SIVAARAENG DVYFYPPAEN FYVHGILVWN YAPTKVPEEA YEYVNRILEW GKYVGVIAVE FFEARDGRVL VNEIAPRVHN TGHWTLETDA SQFENHVRAV LGLPLRRPRA MAPTAMVNIL GVGLGKLPLA ELERRGRVYW YYKAEARPRR KMGHLNITAG SVEEAITKAR EALRLIYGAD FPRLVMRSRP SP
|
| |