Gene BURPS1106A_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0842 
SymbolpurC 
ID4901865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp823001 
End bp823891 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content66% 
IMG OID640134072 
Productphosphoribosylaminoimidazole-succinocarboxamide synthase 
Protein accessionYP_001065123 
Protein GI126455088 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0152] Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase 
TIGRFAM ID[TIGR00081] phosphoribosylaminoimidazole-succinocarboxamide synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.818868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACCC TTTACGAATC CACGCTGCGC TCGCTGCCGC TCCTCGGTCG CGGCAAGGTC 
CGCGACAACT ACGCGCTCGG CAACGACAAG CTCCTGATCG TCACGACCGA TCGCCTGTCG
GCGTTCGACG TCATCATGGG CGAGCCGATT CCGAACAAGG GCCGCGTGCT GAACCAGATG
GCGAACTTCT GGTTCGACAG GCTCGCGCAC ATCGTCCCGA ACCATCTGAC GGGCGTCGCG
CCCGAGACGG TCGTCGCCGC CGACGAGGTC GAGCAGGTGA AGGGGCGCGC GGTCGTCGTC
AAGCGGCTCG AGCCGATCCT CGTCGAGGCG GTCGTGCGCG GCTATCTGGC GGGCAGCGGC
TGGAAGGACT ACCAGGCGAC GGGCAAGGTG TGCGGCGTCG AGCTGCCGGC CGGCCTGTCG
AACGCGCAGA AGCTCCCCGA GCCGATCTTC ACGCCCGCCG CGAAGGCCGA GATGGGCCAT
CACGACGAGA ACATCTCGTT CGAGGAAACC GAGCGGCGCA TCGGCACCGA GCTCGCCGCG
ACGATTCGCG ACATCTCGAT CAGGCTGTAC AAGGAAGCGG CCGATTACGC GGCGACGCGC
GGCATCATCA TCGCCGACAC GAAGTTCGAG TTCGGCCTCG ACGAGCACGG CGAGCTGTTC
CTGATGGACG AGGCGTTGAC GGCCGATTCG TCGCGCTTCT GGCCGGCGGA CGAATACCGG
GTCGGCACGA ACCCGCCGTC GTTCGACAAG CAGTTCGTCC GCGACTGGCT CGAGGCGCAG
AACTGGAACA AGGCGCCGCC CGCGCCGAAG CTGCCCGACG ATGTGGTCGC GAAGACGAGC
GCGAAGTATC AGGAAGCGCT CGAGCGCATC ACGGGCAAGA CGCTCGACTG A
 
Protein sequence
MSTLYESTLR SLPLLGRGKV RDNYALGNDK LLIVTTDRLS AFDVIMGEPI PNKGRVLNQM 
ANFWFDRLAH IVPNHLTGVA PETVVAADEV EQVKGRAVVV KRLEPILVEA VVRGYLAGSG
WKDYQATGKV CGVELPAGLS NAQKLPEPIF TPAAKAEMGH HDENISFEET ERRIGTELAA
TIRDISIRLY KEAADYAATR GIIIADTKFE FGLDEHGELF LMDEALTADS SRFWPADEYR
VGTNPPSFDK QFVRDWLEAQ NWNKAPPAPK LPDDVVAKTS AKYQEALERI TGKTLD