Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2695 |
Symbol | purM |
ID | 6484212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2613803 |
End bp | 2614855 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642738027 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_002041761 |
Protein GI | 194442301 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGAACC AGGCAGTGAC CGATAAGACC TCTCTTAGCT ATAAAGATGC CGGCGTCGAT ATTGATGCGG GTAACGCTCT GGTTGATCGA ATCAAAGGCG TAGTGAAGAA AACTCGCCGC CCGGAGGTTA TGGGCGGTCT GGGCGGTTTC GGTGCGCTGT GCGCGTTGCC GCAAAAATAT CGTGAACCGG TACTGGTTTC CGGCACTGAC GGCGTAGGCA CCAAACTTCG CCTGGCGATG GACTTAAAGC GTCACGACGC TATCGGTATT GATCTGGTGG CGATGTGCGT AAACGATCTG GTCGTTCAGG GCGCGGAACC GCTGTTTTTC CTCGATTACT ATGCCACGGG TAAACTGGAT GTCGATACCG CCGCCAGCGT GATCAACGGT ATTGCCGAAG GCTGCCTGCA ATCCGGCTGC GCGCTGGTCG GCGGCGAGAC GGCGGAAATG CCGGGCATGT ATCACGGCGA AGATTACGAT GTGGCGGGTT TCTGCGTCGG CGTAGTCGAA AAATCAGAAA TCATCGACGG CTCCCGGGTT GCCGAAGGCG ACGTGCTGAT TGCACTCGGC TCCAGCGGCC CGCACTCGAA TGGATATTCG CTGGTGCGGA AAATTATTGA CGTTAGCGGC TGCGACCCAC AAACCACTCT GCTGGAAGGG AAGCCGCTGG CCGATCATCT GCTTGAACCG ACCCGTATCT ACGTAAAATC GGTTCTGGAA CTGATTGAAA ACGTCGATGT ACACGCTATC GCCCACCTCA CCGGCGGGGG CTTTTGGGAA AATATTCCGC GCGTTCTGCC GGAGAATACC CAGGCGGTAA TTAATGAGTC GTCCTGGCAG TGGCCCGCCA TCTTTACCTG GCTGCAAACC GCCGGTAATG TCAGCCGACA TGAAATGTAC CGTACCTTTA ACTGCGGCGT CGGCATGGTG ATTGCGCTCT CCGCTCCGGA GGCGGACAAA GCGCTTGCTC TGCTAAACGA GAAAGGTGAA AACGCATGGA AAATCGGTAT CATCAAAGCC TCTGATTCCG AACAGCGTGT GGTTATTGAA TAA
|
Protein sequence | MGNQAVTDKT SLSYKDAGVD IDAGNALVDR IKGVVKKTRR PEVMGGLGGF GALCALPQKY REPVLVSGTD GVGTKLRLAM DLKRHDAIGI DLVAMCVNDL VVQGAEPLFF LDYYATGKLD VDTAASVING IAEGCLQSGC ALVGGETAEM PGMYHGEDYD VAGFCVGVVE KSEIIDGSRV AEGDVLIALG SSGPHSNGYS LVRKIIDVSG CDPQTTLLEG KPLADHLLEP TRIYVKSVLE LIENVDVHAI AHLTGGGFWE NIPRVLPENT QAVINESSWQ WPAIFTWLQT AGNVSRHEMY RTFNCGVGMV IALSAPEADK ALALLNEKGE NAWKIGIIKA SDSEQRVVIE
|
| |