Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1189 |
Symbol | |
ID | 4446320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1289709 |
End bp | 1290950 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639688996 |
Product | phosphoribosylaminoimidazole carboxylase |
Protein accession | YP_830683 |
Protein GI | 116669750 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGAAGC CCGGAACACC CCTGCTCTTC TGTAGGCTGG CTCTTGTGAC TTTTCCAGTA ATAGGCGTAG TTGGCGGCGG CCAGCTAGCC CGCATGATGG CCCCCGCCGC AACGGCCCTG GGCTTTGAAC TCCGTGTCCT GGCCGAAGGC GAGGACGTTT CCGCGGTTTC CGCAGTGCCG ACGTCGCCGG TGGGCGACTA CAAGGACCTT GACGCCCTCC TCGAGTTCTC CCGGGGGCTG GACGTCATGA CCTTTGACCA CGAGCACGTC CCCAACGACC ACCTGCGGGC ACTGCAGGAG GCCGGCGTCA ACGTCCAGCC CGGCCCGGAC GCCCTGGTCC ACGCGCAGGA CAAGCTGGTG ATGCGGGCAG CCATCGACCG GCTTGAGCTG CCCAACCCGG CCTGGGCCTC CGTTGCCGAC GTCGAGGCCC TGGTTGCCTT CGGCGAGAAG ACCGGGTGGC CGGTGGTGTT GAAGACGCCC CGCGGCGGTT ACGACGGCAA AGGGGTCCGC ATGGTCGGAT CGGCTGAGGA AGCCGCCGAC GCCGCCGACT GGTTTGCGGC CATGACCCCG CTGCTGGCCG AGGCCAAGGT GGAGTTCAGC CGCGAACTGT CCGCACTCGT AGCGAGGACT CCTGACGGTG AATCCCGCGC CTGGCCCGTG GTCCACACCA TCCAGGTGGA CGGCGTCTGC GACGAAGTGA TCGCCCCGGC CCAGGACATT CCGCTTGAAG TCGCCGCGGC CGCCGAAGAC GCCGCAATCC GCATCGCCAA CGAACTCGGA GTCACCGGCG TCATGGCCGT GGAGCTCTTC GAAACCCCCG GCGTCGGCTC CGGCTTCCTG ATCAACGAGC TCGCAATGCG CCCGCACAAC ACCGGCCACT GGACCCAGGA CGGATCGGTC ACGAGCCAGT TCGAACAGCA CCTGCGTGCC GTGCTGAACC TCCCGCTCGG TGCCACCGAC GCTCTGGGAC AGATTGTTGT GATGAAGAAC TTCCTTGGCG GCGAGAACCA GGAACTGTTC TCGGCGTATC CGCTGGCCAT GGCCAGCGAG CCGGCCGCGA AGATCCACTG CTACGGCAAG GCCGTCAGGC CCGGCAGGAA GATCGGCCAC GTCAACCTGG TGGGGGCAGC CGCTTCCGAT GTCGACTCCG TCCGGCAGCG CGCCACCACC GTCGCCAACA TCATCAGGGA CGGCCGTGCT CCGGCCCGAC CTGCACCAGG GAACTCCGAG GAGACCGTAT GA
|
Protein sequence | MVKPGTPLLF CRLALVTFPV IGVVGGGQLA RMMAPAATAL GFELRVLAEG EDVSAVSAVP TSPVGDYKDL DALLEFSRGL DVMTFDHEHV PNDHLRALQE AGVNVQPGPD ALVHAQDKLV MRAAIDRLEL PNPAWASVAD VEALVAFGEK TGWPVVLKTP RGGYDGKGVR MVGSAEEAAD AADWFAAMTP LLAEAKVEFS RELSALVART PDGESRAWPV VHTIQVDGVC DEVIAPAQDI PLEVAAAAED AAIRIANELG VTGVMAVELF ETPGVGSGFL INELAMRPHN TGHWTQDGSV TSQFEQHLRA VLNLPLGATD ALGQIVVMKN FLGGENQELF SAYPLAMASE PAAKIHCYGK AVRPGRKIGH VNLVGAAASD VDSVRQRATT VANIIRDGRA PARPAPGNSE ETV
|
| |