Gene BMASAVP1_A1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1074 
SymbolpurD 
ID4679718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1056854 
End bp1058131 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content68% 
IMG OID639845348 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_992414 
Protein GI121600230 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.497421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTAC TCGTCGTCGG CTCCGGCGGC CGCGAACATG CGCTCGCCTG GAAGCTCGCC 
CAGTCGCCGC GCGTCCAGCT CGTCTACGTT GCGCCCGGCA ACGGCGGCAC CGCGCAGGAC
GAGCGCCTGA AGAACGTCGA TCTGAGCTCG CTCGACGATC TGGCCGATTT CGCCGAATCC
GAAGGCGTCG CGTTCACGCT CGTCGGCCCG GAAGCGCCGC TCGCGGCGGG CATCGTCAAT
CATTTCCGCG CGCGCGGCCT GAAGATCTTC GGCCCGACGA AGGAAGCGGC GCAGCTCGAG
AGCTCGAAGG ACTTCGCGAA GGCGTTCATG AAGCGCCACG GCATTCCGAC CGCCGATTAC
GAGACGTTCA CCGACGCGGC CGCCGCGCAC GCGTACGTCG ATTCGAAAGG CGCGCCGATC
GTCGTGAAGG CGGACGGCCT CGCCGCGGGC AAGGGCGTCG TCGTCGCGAT GACGCTGGAA
GAAGCGCACG CGGCCGTCGA CATGATGCTG TCCGGCAACA AGCTCGGCGA TGCCGGCGCG
CGCGTCGTGA TCGAGGCATT CCTCGACGGC GAGGAAGCGA GCTTCATCGT GATGGTCGAC
GGCAAGCACG CGCTCGCGCT CGCGTCGAGC CAGGATCACA AGCGTCTGCT CGACGAGGAC
CGCGGCCCGA ACACGGGCGG CATGGGCGCG TACTCGCCGG CGCCGATCGT CACGCCGCAG
ATGCATGCGC GCGTGATGCG CGAGATCATC ATGCCGACCG TGCGCGGGAT GGAGAAGGAC
GGCATCCGCT TCACGGGCTT CCTGTACGCG GGCCTGATGA TCGACCGCGA CGGCAATCCG
CGCACGCTCG AATTCAACTG CCGGATGGGC GACCCCGAAA CGCAGCCGAT CATGGCGCGC
CTGAAGAGCG ATTTCTCGAA GGTCGTCGAG CAGGCGATCG CGGGCACGCT CGATACCGTC
GAGCTCGACT GGGACCGCCG CACGGCGCTC GGCGTCGTGC TCGCCGCGCA CGGCTACCCC
GATGCGCCGC GCAAGGGCGA CCGGATCAAC GGCATCCCGG CCGGAACGGC GCACGCGGTG
ACGTTCCACG CGGGCACGAC GTTCGACGGC GACAAGCTCG TCACCTCGGG CGGGCGCGTG
CTGTGCGTGG TCGGCCTCGC GGATTCGGTG CGCGGCGCGC AGCAGGCCGC GTACGAGACG
ATCAACCAGA TCAACTTCGA AGGCATGCAG TATCGCCGCG ACATCGGCTA CCGCGCGCTC
AACCGCAAGA CGGTCTGA
 
Protein sequence
MKLLVVGSGG REHALAWKLA QSPRVQLVYV APGNGGTAQD ERLKNVDLSS LDDLADFAES 
EGVAFTLVGP EAPLAAGIVN HFRARGLKIF GPTKEAAQLE SSKDFAKAFM KRHGIPTADY
ETFTDAAAAH AYVDSKGAPI VVKADGLAAG KGVVVAMTLE EAHAAVDMML SGNKLGDAGA
RVVIEAFLDG EEASFIVMVD GKHALALASS QDHKRLLDED RGPNTGGMGA YSPAPIVTPQ
MHARVMREII MPTVRGMEKD GIRFTGFLYA GLMIDRDGNP RTLEFNCRMG DPETQPIMAR
LKSDFSKVVE QAIAGTLDTV ELDWDRRTAL GVVLAAHGYP DAPRKGDRIN GIPAGTAHAV
TFHAGTTFDG DKLVTSGGRV LCVVGLADSV RGAQQAAYET INQINFEGMQ YRRDIGYRAL
NRKTV