Gene SAG0027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0027 
SymbolpurM 
ID1012777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp42202 
End bp43224 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content45% 
IMG OID637315182 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionNP_687063 
Protein GI22536212 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA AAAATGCTTA TGCCCAGTCT GGTGTTGATG TAGAAGCGGG CTACGAAGTT 
GTCGAACGTA TCAAGAAACA CGTTGCTCGC ACAGAACGTG CTGGTGTCAT GGGAGCTCTC
GGTGGCTTTG GTGGTATGTT TGACCTGTCA CAAACAGGTG TTAAAGAACC TGTCTTGATT
TCAGGGACTG ACGGTGTCGG AACTAAACTC ATGCTTGCTA TCAAGTACGA CAAGCACGAC
ACAATCGGTC AAGACTGTGT TGCCATGTGT GTCAACGATA TTATTGCAGC AGGTGCTGAG
CCCCTTTACT TCCTTGACTA TGTCGCGACT GGTAAAAACG AACCTGCCAA ATTGGAACAG
GTTGTCGCTG GTGTTGCTGA AGGTTGTGTT CAAGCTAGCG CAGCTCTTAT CGGTGGTGAA
ACGGCTGAGA TGCCTGGTAT GTATGGCGAA GATGATTATG ACCTTGCAGG CTTTGCTGTT
GGTGTGGCTG AAAAATCTCA AATCATCGAC GGTTCAAAGG TAAAAGAAGG GGATATTCTT
CTTGGACTTG CTTCAAGTGG TATCCATTCA AATGGTTATT CATTGGTACG TCGTGTCTTT
GCTGACTACA CTGGTGATGA GGTGCTTCCA GAGCTTGAAG GCAAACAACT CAAGGATGTC
CTTCTTGAGC CAACTCGTAT CTATGTTAAA GCAGCTCTGC CATTGATCAA GGAAGAACTA
GTTAACGGTA TCGCCCACAT CACGGGTGGT GGTTTTATCG AGAATGTTCC TCGTATGTTT
GCGGATGATT TGGCTGCGGA AATCGATGAG GATAAGGTGC CTGTACTTCC GATTTTCAAG
GCGCTTGAAA AATATGGTGA CATCAAGCAC GAAGAAATGT TTGAAATCTT CAACATGGGT
GTCGGTCTTA TGCTGGATGT TAACCCTGAA AATGTTGACC GTGTCAAAGA ACTTTTGGAC
GAACCAGTCT ATGAAATCGG TCGTATCATC AAGAAAGCAG ACGATAGTGT GGTGATTAAA
TAA
 
Protein sequence
MSEKNAYAQS GVDVEAGYEV VERIKKHVAR TERAGVMGAL GGFGGMFDLS QTGVKEPVLI 
SGTDGVGTKL MLAIKYDKHD TIGQDCVAMC VNDIIAAGAE PLYFLDYVAT GKNEPAKLEQ
VVAGVAEGCV QASAALIGGE TAEMPGMYGE DDYDLAGFAV GVAEKSQIID GSKVKEGDIL
LGLASSGIHS NGYSLVRRVF ADYTGDEVLP ELEGKQLKDV LLEPTRIYVK AALPLIKEEL
VNGIAHITGG GFIENVPRMF ADDLAAEIDE DKVPVLPIFK ALEKYGDIKH EEMFEIFNMG
VGLMLDVNPE NVDRVKELLD EPVYEIGRII KKADDSVVIK