Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG0045 |
Symbol | purK |
ID | 1012795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | + |
Start bp | 60012 |
End bp | 61103 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637315200 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | NP_687081 |
Protein GI | 22536230 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCAT TTAAGACCAT TGGGATTATT GGTGGTGGTC AGCTGGGGCA GATGATGGCG ATTGCGGCTA TCTACATGGG CCACAAGGTC ATTACGCTGG ATCCAGCTAG CGACTGCCCT GCCTCCCGCG TTAGCGAGGT GATTGTGGCA CCTTACGATG ATGTTGAGGC TTTGGGAACA TTAGCTGCGC GTTGCGATGT TTTGACCTAT GAGTTTGAGA ATGTCGATGC CGATGGTCTG GATGCCGTTG TGTCAGCTGG TCAGCTACCG CAGGGGACTG ATCTGCTCCG CATTTCTCAA AACCGTATCT TTGAAAAAGA CTTTCTGGCA AATAAGGCTG GCGTGACTGT CGCTCCCTAT AAGGTGGTGA CATCTAGCCT TGACCTAGAG GGGCTTGACT TGACCAAGAC CTATGTCCTC AAGACAGCGA CAGGTGGTTA TGACGGTCAT GGGCAAAAGG TTATCCGCTC AGCAGAAGAC CTGCCAGAGG CGCAGCAATT AGCCAACTCA GCTCAGTGTG TCTTGGAAGA GTTTGTCAAC TTCGACCTTG AAATATCAGT CATCGTGTCT GGAAATGGTC AGGATGTGAC GGTCTTTCCC GTTCAGGAAA ATATCCACCG CAACAATATC CTGTCAAAAA CCATCGTACC AGCTCGCATC TCAGACCAAC TAGCTGACAA GGCTAAGGAA ATGGCTGTGC AGATTGCCAA GAAACTCCAG CTATCAGGAA CCCTCTGTGT GGAAATGTTT GCGACCGCAG ATGACATCAT CGTCAATGAA ATTGCCCCAC GTCCCCACAA CTCAGGGCAC TACTCTATCG AAGCCTGCGA CTTTTCACAG TTTGACACCC ACATCTTGGG CGTACTGGGC GCACCGCTTC CGCCAATCAA ACTCCATGCT CCAGCCGTTA TGTTCAATGT CCTAGGACAA CATGTCCAGC AGGCAATTGA CCATGTTGCC CAAAACCCTA GCGCCCACCT CCACATGTAT GGTAAACTAG AAGCAAAACA TAACCGCAAA ATGGGACACG TGACGGTGTT TAGCGATGTA CCTGATGAGG TGGAAGAGTT TGAAGAAAGG ATGGATTTCT AA
|
Protein sequence | MNSFKTIGII GGGQLGQMMA IAAIYMGHKV ITLDPASDCP ASRVSEVIVA PYDDVEALGT LAARCDVLTY EFENVDADGL DAVVSAGQLP QGTDLLRISQ NRIFEKDFLA NKAGVTVAPY KVVTSSLDLE GLDLTKTYVL KTATGGYDGH GQKVIRSAED LPEAQQLANS AQCVLEEFVN FDLEISVIVS GNGQDVTVFP VQENIHRNNI LSKTIVPARI SDQLADKAKE MAVQIAKKLQ LSGTLCVEMF ATADDIIVNE IAPRPHNSGH YSIEACDFSQ FDTHILGVLG APLPPIKLHA PAVMFNVLGQ HVQQAIDHVA QNPSAHLHMY GKLEAKHNRK MGHVTVFSDV PDEVEEFEER MDF
|
| |