Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_0060 |
Symbol | |
ID | 4436857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 48320 |
End bp | 49411 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 639675824 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_819627 |
Protein GI | 116627008 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCAA CTAAAACCAT TGGTATCATC GGTGGTGGCC AGCTTGGTCA GATGATGGCC ATTTCTGCTA TCTATATGGG CCACAAGGTT ATCACCCTTG ATCCTGCATC AGATTGTCCA TCTTCTCGTG TGTCTGAGGT TATCGCGGCA CCCTACGATG ACGTAGATGC TCTTCGTCAG TTGGCGGACT GCTGTGATGT TCTCACTTAT GAATTTGAGA ATGTCGACGC TGACGGTCTT GACGCTGTCA TCAAGGATGG ACAACTTCCA CAAGGAACAG AACTGCTTCG CATTTCACAA AACCGTATCT TTGAGAAGGA CTTCCTTTCA AACAAGGCTC AAGTAACGGT GGCACCTTAC AAGGTCGTGA CCTCTAGCCT TGATTTGGAA GATATTGATC TTTCTAAAAA TTACGTCCTC AAGACTGCGA CAGGTGGTTA CGATGGCCAC GGTCAAAAAG TCATCACATC AGCCGAAGAT TTGGAAGAGG CAAATGCACT TGCTAACTCA GCTGAGTGTG TCTTGGAAGA GTTCGTCAAC TTCGACCTTG AAATTTCGGT TATCGTGTCA GGTAACGGCA AGGATGTGAC GGTTTTCCCA GTTCAGGAAA ATATCCACCG CAACAACATC CTCTCTAAGA CTATCGTTCC AGCTCGTATT TCTGATAGAC TAGCAGACAG AGCTAAAGCT ATTGCTGTGA AGATTGCTGA GCAACTTAAC CTCTCTGGTA CCCTTTGTGT AGAAATGTTT GCGACAGCTG ATGACATCAT TGTCAACGAA ATTGCGCCAC GCCCACACAA TTCAGGGCAC TACTCAATCG AAGCCTGCGA CTTTTCACAA TTTGACACAC ATATCTTGGG CGTTCTCGGA GCACCACTTC CAGCAATCAA CCTCCATGAA CCTGCTGTTA TGCTCAACGT CCTCGGCCAA CACGTCGAAG CAGCTGAGCG TTATGTCACA GAAAATCCAA GCGCCCACCT CCACATGTAT GGTAAACTAG AAGCGAAGCA CAACCGAAAG ATGGGTCATG TGACTTTGTT TAGTAATGAG CCAGATAATG TGGTTGAGTT TGGGAAAGGA ATTGATTTTT AG
|
Protein sequence | MSSTKTIGII GGGQLGQMMA ISAIYMGHKV ITLDPASDCP SSRVSEVIAA PYDDVDALRQ LADCCDVLTY EFENVDADGL DAVIKDGQLP QGTELLRISQ NRIFEKDFLS NKAQVTVAPY KVVTSSLDLE DIDLSKNYVL KTATGGYDGH GQKVITSAED LEEANALANS AECVLEEFVN FDLEISVIVS GNGKDVTVFP VQENIHRNNI LSKTIVPARI SDRLADRAKA IAVKIAEQLN LSGTLCVEMF ATADDIIVNE IAPRPHNSGH YSIEACDFSQ FDTHILGVLG APLPAINLHE PAVMLNVLGQ HVEAAERYVT ENPSAHLHMY GKLEAKHNRK MGHVTLFSNE PDNVVEFGKG IDF
|
| |