Gene SAG0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0045 
SymbolpurK 
ID1012795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp60012 
End bp61103 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content50% 
IMG OID637315200 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionNP_687081 
Protein GI22536230 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCAT TTAAGACCAT TGGGATTATT GGTGGTGGTC AGCTGGGGCA GATGATGGCG 
ATTGCGGCTA TCTACATGGG CCACAAGGTC ATTACGCTGG ATCCAGCTAG CGACTGCCCT
GCCTCCCGCG TTAGCGAGGT GATTGTGGCA CCTTACGATG ATGTTGAGGC TTTGGGAACA
TTAGCTGCGC GTTGCGATGT TTTGACCTAT GAGTTTGAGA ATGTCGATGC CGATGGTCTG
GATGCCGTTG TGTCAGCTGG TCAGCTACCG CAGGGGACTG ATCTGCTCCG CATTTCTCAA
AACCGTATCT TTGAAAAAGA CTTTCTGGCA AATAAGGCTG GCGTGACTGT CGCTCCCTAT
AAGGTGGTGA CATCTAGCCT TGACCTAGAG GGGCTTGACT TGACCAAGAC CTATGTCCTC
AAGACAGCGA CAGGTGGTTA TGACGGTCAT GGGCAAAAGG TTATCCGCTC AGCAGAAGAC
CTGCCAGAGG CGCAGCAATT AGCCAACTCA GCTCAGTGTG TCTTGGAAGA GTTTGTCAAC
TTCGACCTTG AAATATCAGT CATCGTGTCT GGAAATGGTC AGGATGTGAC GGTCTTTCCC
GTTCAGGAAA ATATCCACCG CAACAATATC CTGTCAAAAA CCATCGTACC AGCTCGCATC
TCAGACCAAC TAGCTGACAA GGCTAAGGAA ATGGCTGTGC AGATTGCCAA GAAACTCCAG
CTATCAGGAA CCCTCTGTGT GGAAATGTTT GCGACCGCAG ATGACATCAT CGTCAATGAA
ATTGCCCCAC GTCCCCACAA CTCAGGGCAC TACTCTATCG AAGCCTGCGA CTTTTCACAG
TTTGACACCC ACATCTTGGG CGTACTGGGC GCACCGCTTC CGCCAATCAA ACTCCATGCT
CCAGCCGTTA TGTTCAATGT CCTAGGACAA CATGTCCAGC AGGCAATTGA CCATGTTGCC
CAAAACCCTA GCGCCCACCT CCACATGTAT GGTAAACTAG AAGCAAAACA TAACCGCAAA
ATGGGACACG TGACGGTGTT TAGCGATGTA CCTGATGAGG TGGAAGAGTT TGAAGAAAGG
ATGGATTTCT AA
 
Protein sequence
MNSFKTIGII GGGQLGQMMA IAAIYMGHKV ITLDPASDCP ASRVSEVIVA PYDDVEALGT 
LAARCDVLTY EFENVDADGL DAVVSAGQLP QGTDLLRISQ NRIFEKDFLA NKAGVTVAPY
KVVTSSLDLE GLDLTKTYVL KTATGGYDGH GQKVIRSAED LPEAQQLANS AQCVLEEFVN
FDLEISVIVS GNGQDVTVFP VQENIHRNNI LSKTIVPARI SDQLADKAKE MAVQIAKKLQ
LSGTLCVEMF ATADDIIVNE IAPRPHNSGH YSIEACDFSQ FDTHILGVLG APLPPIKLHA
PAVMFNVLGQ HVQQAIDHVA QNPSAHLHMY GKLEAKHNRK MGHVTVFSDV PDEVEEFEER
MDF