Gene SNSL254_A2695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2695 
SymbolpurM 
ID6484212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2613803 
End bp2614855 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content56% 
IMG OID642738027 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_002041761 
Protein GI194442301 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGAACC AGGCAGTGAC CGATAAGACC TCTCTTAGCT ATAAAGATGC CGGCGTCGAT 
ATTGATGCGG GTAACGCTCT GGTTGATCGA ATCAAAGGCG TAGTGAAGAA AACTCGCCGC
CCGGAGGTTA TGGGCGGTCT GGGCGGTTTC GGTGCGCTGT GCGCGTTGCC GCAAAAATAT
CGTGAACCGG TACTGGTTTC CGGCACTGAC GGCGTAGGCA CCAAACTTCG CCTGGCGATG
GACTTAAAGC GTCACGACGC TATCGGTATT GATCTGGTGG CGATGTGCGT AAACGATCTG
GTCGTTCAGG GCGCGGAACC GCTGTTTTTC CTCGATTACT ATGCCACGGG TAAACTGGAT
GTCGATACCG CCGCCAGCGT GATCAACGGT ATTGCCGAAG GCTGCCTGCA ATCCGGCTGC
GCGCTGGTCG GCGGCGAGAC GGCGGAAATG CCGGGCATGT ATCACGGCGA AGATTACGAT
GTGGCGGGTT TCTGCGTCGG CGTAGTCGAA AAATCAGAAA TCATCGACGG CTCCCGGGTT
GCCGAAGGCG ACGTGCTGAT TGCACTCGGC TCCAGCGGCC CGCACTCGAA TGGATATTCG
CTGGTGCGGA AAATTATTGA CGTTAGCGGC TGCGACCCAC AAACCACTCT GCTGGAAGGG
AAGCCGCTGG CCGATCATCT GCTTGAACCG ACCCGTATCT ACGTAAAATC GGTTCTGGAA
CTGATTGAAA ACGTCGATGT ACACGCTATC GCCCACCTCA CCGGCGGGGG CTTTTGGGAA
AATATTCCGC GCGTTCTGCC GGAGAATACC CAGGCGGTAA TTAATGAGTC GTCCTGGCAG
TGGCCCGCCA TCTTTACCTG GCTGCAAACC GCCGGTAATG TCAGCCGACA TGAAATGTAC
CGTACCTTTA ACTGCGGCGT CGGCATGGTG ATTGCGCTCT CCGCTCCGGA GGCGGACAAA
GCGCTTGCTC TGCTAAACGA GAAAGGTGAA AACGCATGGA AAATCGGTAT CATCAAAGCC
TCTGATTCCG AACAGCGTGT GGTTATTGAA TAA
 
Protein sequence
MGNQAVTDKT SLSYKDAGVD IDAGNALVDR IKGVVKKTRR PEVMGGLGGF GALCALPQKY 
REPVLVSGTD GVGTKLRLAM DLKRHDAIGI DLVAMCVNDL VVQGAEPLFF LDYYATGKLD
VDTAASVING IAEGCLQSGC ALVGGETAEM PGMYHGEDYD VAGFCVGVVE KSEIIDGSRV
AEGDVLIALG SSGPHSNGYS LVRKIIDVSG CDPQTTLLEG KPLADHLLEP TRIYVKSVLE
LIENVDVHAI AHLTGGGFWE NIPRVLPENT QAVINESSWQ WPAIFTWLQT AGNVSRHEMY
RTFNCGVGMV IALSAPEADK ALALLNEKGE NAWKIGIIKA SDSEQRVVIE