Gene SAG0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0043 
SymbolpurD 
ID1012793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp57991 
End bp59256 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content50% 
IMG OID637315198 
Productphosphoribosylamine--glycine ligase 
Protein accessionNP_687079 
Protein GI22536228 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGC TTGTTGTTGG TTCTGGTGGT CGTGAGCATG CGATTGCTAA GAAGTTGTTA 
GCGTCTAAGG ATGTGGATCA GGTTTTTGTG GCACCTGGTA ATGATGGTAT GACCTTGGAT
GGTCTAGACT TGGTAAATAT CGGAATTTCC GAACATTCTA GACTGATTGA CTTTGTTAAG
GAGAATGAGA TTGCTTGGAC CCTTATTGGT CCTGATGATG CGCTAGCAGC TGGTATCGTT
GATGGTTTTA ATAGTGCTGG ACTCAGAGCA TTTGGTCCAA CCAAGGCAGC CGCGGAGCTA
GAGTGGTCAA AAGACTTTGC CAAGGAAATC ATGGTCAAAT ACAATGTTCC AACAGCAGCC
TATGGCACAT TTTCAGATTT TGAAAAAGCT AAAGCCTACA TCGAAGAGCA GGGCGCACCA
ATCGTGGTCA AGGCTGACGG ATTGGCGTTA GGCAAGGGCG TGGTCGTGGC TGAAACCGTT
GAGCAGGCGG TAGAGGCGGC GCAAGAGATG CTTTTGGACA ACAAGTTTGG CGACTCGGGT
GCGCGCGTGG TTATCGAGGA ATTCTTGGAT GGTGAAGAGT TCTCCCTTTT CGCCTTCGCT
AATGGCGATA AGTTCTACAT CATGCCGACA GCTCAGGATC ACAAGCGTGC CTATGATGGT
GACAAGGGGC TAAATACCGG TGGTATGGGT GCCTATGCGC CAGTTCCCCA CCTGCCTCAG
AGCGTGGTGG ATACAGCAGT TGAGACTATC GTTAAGCCTG TCCTTGAAGG CATGATTGCC
GAAGGGCGTC CTTATCTAGG TGTCCTCTAT GCTGGGCTTA TCCTGACGGC TGATGGCCCT
AAGGTTATCG AGTTCAACTC ACGTTTTGGT GACCCTGAAA CTCAGATTAT CCTCCCTCGC
CTGACTTCCG ATTTCGCTCA GAACATCGAC GACATCATGA TGGGCATCGA GCCTTACATC
ACTTGGCAGA AGGACGGCGT GACTCTGGGC GTTGTCGTTG CCTCAGAAGG CTATCCGCTC
GATTACGAGA AAGGTGTGCC ACTGCCTGAA AAGACCGACG GCGACATCAT CACCTACTAT
GCGGGAGCTA AGTTTGCGGA AAATAGCAAA GCACTGCTCT CAAACGGAGG ACGTGTCTAC
ATGCTTGTCA CCACAGAAGA CAGCGTCAAA GCAGGGCAGG ACAAAATCTA TACCCAACTC
GCCCAACAAG ACACAACAGG CCTCTTCTAC CGAAACGACA TCGGAAGCAA AGCTATTAAG
GAATAA
 
Protein sequence
MKLLVVGSGG REHAIAKKLL ASKDVDQVFV APGNDGMTLD GLDLVNIGIS EHSRLIDFVK 
ENEIAWTLIG PDDALAAGIV DGFNSAGLRA FGPTKAAAEL EWSKDFAKEI MVKYNVPTAA
YGTFSDFEKA KAYIEEQGAP IVVKADGLAL GKGVVVAETV EQAVEAAQEM LLDNKFGDSG
ARVVIEEFLD GEEFSLFAFA NGDKFYIMPT AQDHKRAYDG DKGLNTGGMG AYAPVPHLPQ
SVVDTAVETI VKPVLEGMIA EGRPYLGVLY AGLILTADGP KVIEFNSRFG DPETQIILPR
LTSDFAQNID DIMMGIEPYI TWQKDGVTLG VVVASEGYPL DYEKGVPLPE KTDGDIITYY
AGAKFAENSK ALLSNGGRVY MLVTTEDSVK AGQDKIYTQL AQQDTTGLFY RNDIGSKAIK
E