Gene SAG0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0047 
SymbolpurB 
ID1012797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp62573 
End bp63871 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content45% 
IMG OID637315202 
Productadenylosuccinate lyase 
Protein accessionNP_687083 
Protein GI22536232 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.727181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAAC GTTATTCACG CCCTGAGATG GCGGCAATTT GGACAGAGGA AAATAAATAC 
CGTGCTTGGT TGGAGGTCGA GATTTTGGCT GACGAGGCAT GGGCTGAGTT GGGTGAGATT
CCTAAGGAGG ATGTGGCTAA GATTCGTGAG AAGGCGGATT TTGACATTGA CCGCATTCTT
GAGATTGAGC AGGACACGCG TCACGATGTG GTGGCTTTCA CTCGTGCGGT TTCTGAGACG
CTTGGTGAGG AGCGCAAGTG GGTGCACTAC GGTTTGACGT CGACTGACGT GGTGGACACT
GCCTACGGTT ACCTCTACAA GCAGGCTAAC GATATTATCC GTCGTGACCT TGAGAATTTC
ACAAATATTG TGGCTGATAA GGCTAAGGAG CACAAGTTCA CCATCATGAT GGGTCGTACC
CACGGTGTTC ACGCTGAGCC AACGACTTTC GGTCTTAAGT TGGCGACCTG GTACAGCGAG
ATGAAACGTA ATATTGAGCG TTTTGAACAT GCTGCCGCAG GTGTGGAAGC TGGTAAGATT
TCAGGTGCCG TTGGTAACTT TGCTAACATT CCTCCATTTG TGGAACAATA TGTTTGTGAC
AAATTGGGCA TCCGTCCGCA AGAGATTTCA ACACAGGTTC TTCCACGTGA CCTCCACGCA
GAATATTTTG CAGTGCTTGC AAGCATTGCA ACTTCTATCG AACGTATGGC GACAGAGATT
CGTGGTCTGC AAAAATCAGA ACAACGTGAA GTTGAAGAAT TCTTTGCCAA AGGTCAGAAA
GGTAGCTCTG CTATGCCTCA CAAACGCAAC CCAATCGGTT CAGAGAACAT GACTGGGCTA
GCGCGCGTGA TTCGTGGTCA CATGGTGACG GCTTATGAGA ACGTGGCACT TTGGCACGAG
CGTGATATTT CGCACTCATC TGCTGAGCGT ATCATCACAC CTGACACAAC GATCTTGATT
GACTACATGC TCAACCGTTT TGGCAATATC GTTAAGAACT TGACTGTCTT CCCGGAAAAT
ATGATGCGCA ATATGGAATC AACTTTTGGT TTGATTTATA GTCAACGTGT TATGCTCAAA
TTGATTGAAA AAGGAATGAC ACGAGAAGAA GCTTATGACT TAGTTCAACC TAAGACAGCT
TATTCCTGGG ACAATCAAGT GGATTTCAAA CCACTTTTAG AAGAAGACAC CAAAGTTACC
TCTTGTCTTA CACAGGAAGA AATTGATGAA CTATTTAATC CGATTTATTA CACAAAACGT
GTTGATGATA TTTTTGAAAG ACTAGGATTA GAAAAATAA
 
Protein sequence
MIERYSRPEM AAIWTEENKY RAWLEVEILA DEAWAELGEI PKEDVAKIRE KADFDIDRIL 
EIEQDTRHDV VAFTRAVSET LGEERKWVHY GLTSTDVVDT AYGYLYKQAN DIIRRDLENF
TNIVADKAKE HKFTIMMGRT HGVHAEPTTF GLKLATWYSE MKRNIERFEH AAAGVEAGKI
SGAVGNFANI PPFVEQYVCD KLGIRPQEIS TQVLPRDLHA EYFAVLASIA TSIERMATEI
RGLQKSEQRE VEEFFAKGQK GSSAMPHKRN PIGSENMTGL ARVIRGHMVT AYENVALWHE
RDISHSSAER IITPDTTILI DYMLNRFGNI VKNLTVFPEN MMRNMESTFG LIYSQRVMLK
LIEKGMTREE AYDLVQPKTA YSWDNQVDFK PLLEEDTKVT SCLTQEEIDE LFNPIYYTKR
VDDIFERLGL EK