Gene SAG0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0030 
SymbolpurH 
ID1012780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp44735 
End bp46282 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content47% 
IMG OID637315185 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionNP_687066 
Protein GI22536215 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAC GTGCTTTAAT CTCAGTTTCT GACAAATCAG GAATTATTGA CTTTGCAAAA 
GAATTGAAAA ACTTGGGTTG GGATATTATC TCAACTGGTG GGACTAAGGT TGCCCTTGAT
GATGCTGGTG TTGAGACCAT TGCCATCGAC GATGTGACTG GATTCCCAGA AATGATGGAC
GGTCGTGTTA AGACCCTCCA CCCAAACATT CACGGTGGGC TTCTGGCTCG TCGCGACGCT
GACAGCCACC TTCAGGCTGC TAAGGACAAC AATATTGAGT TGATTGACCT CGTGGTTGTC
AACCTCTATC CCTTCAAGGA GACCATCCTT CGCCCAGACG TGACCTACGA TTTGGCGGTG
GAAAATATCG ACATCGGCGG TCCATCAATG CTTCGCTCAG CTGCTAAAAA CCACGCTAGC
GTAACCGTTG TGGTTGATTC AGCTGACTAT GCCACTGTTT TGGGAGAATT GGCTGACGCT
AGTCAGACGA CATTTAAAAC TCGTCAACGC TTGGCAGCTA AGGCCTTCCG TCATACGGCA
GCCTACGACG CTTTGATTGC TGAGTACTTC ACAGCTCAAG TGGGAGAGGC TAAGCCTGAA
AAATTGACCA TCACTTATGA CCTTAAACAG GCTATGCGTT ACGGAGAAAA TCCACAGCAA
GACGCTGATT TCTACCAAAA AGCCTTACCA ACAGACTACT CAATCGCTTC AGCTAAACAG
CTCAATGGAA AAGAATTGTC CTTCAATAAT ATCCGTGATG CTGATGCAGC AATCCGTATT
ATCCGCGATT TCAAAGACAG TCCAACGGTT GTTGCCCTCA AACACATGAA CCCATGTGGT
ATCGGACAGG CTGATGATAT TGAGACAGCT TGGGATTACG CTTATGAAGC TGATCCAGTT
TCAATCTTTG GTGGAATTGT TGTCCTTAAC CGTGAAGTTG ACGCAGCGAC AGCTGAGAAG
ATGCACCCTA TCTTCTTGGA AATCATCATC GCACCATCAT ACTCAGAAGA AGCGCTAGCT
ATTCTCACAA ATAAAAAGAA AAACTTGCGT ATCCTTGAGT TGCCGTTTGA TGCCCAAGCT
GCCAGCGAAG TGGAAGCTGA GTACACTGGC GTAGTTGGTG GACTTTTGGT GCAAAACCAA
GACGTTGTGG CTGAAAATCC ATCTGACTGG CAAGTGGTGA CAGACCGCCA GCCAACAGAA
CAAGAGGCGA CTGCCCTTGA GTTTGCCTGG AAGGCTATCA AGTATGTTAA GTCTAACGGG
ATTATTATTA CTAACGATCA CATGACGCTT GGACTCGGTG CAGGTCAAAC CAACCGTGTC
GGCTCAGTCA AGATTGCTAT CGAGCAGGCT AAGGACCACC TTGACGGTGC CGTTCTAGCA
TCAGATGCCT TCTTCCCATT TGCGGACAAC ATTGAAGAAA TCGCTGCCGC AGGGATCAAA
GCAATCATCC AACCAGGTGG TTCAGTTCGT GACCAAGAAT CTATTGACGC CGCAAACAAA
CATGGCTTGA CCATGATCTT CACAGGCGTG AGACATTTTA GACATTAA
 
Protein sequence
MTKRALISVS DKSGIIDFAK ELKNLGWDII STGGTKVALD DAGVETIAID DVTGFPEMMD 
GRVKTLHPNI HGGLLARRDA DSHLQAAKDN NIELIDLVVV NLYPFKETIL RPDVTYDLAV
ENIDIGGPSM LRSAAKNHAS VTVVVDSADY ATVLGELADA SQTTFKTRQR LAAKAFRHTA
AYDALIAEYF TAQVGEAKPE KLTITYDLKQ AMRYGENPQQ DADFYQKALP TDYSIASAKQ
LNGKELSFNN IRDADAAIRI IRDFKDSPTV VALKHMNPCG IGQADDIETA WDYAYEADPV
SIFGGIVVLN REVDAATAEK MHPIFLEIII APSYSEEALA ILTNKKKNLR ILELPFDAQA
ASEVEAEYTG VVGGLLVQNQ DVVAENPSDW QVVTDRQPTE QEATALEFAW KAIKYVKSNG
IIITNDHMTL GLGAGQTNRV GSVKIAIEQA KDHLDGAVLA SDAFFPFADN IEEIAAAGIK
AIIQPGGSVR DQESIDAANK HGLTMIFTGV RHFRH