Gene BURPS668_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2643 
SymbolargA 
ID4883377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2621402 
End bp2622778 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content69% 
IMG OID640128571 
ProductN-acetylglutamate synthase 
Protein accessionYP_001059668 
Protein GI126438827 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0548] Acetylglutamate kinase
[COG1246] N-acetylglutamate synthase and related acetyltransferases 
TIGRFAM ID[TIGR00761] acetylglutamate kinase
[TIGR01890] amino-acid N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.593428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCCC AAACCGACCT TCCCGCCGCC CAGTCGGGCG CCGCGCCCCA GCCGTCCGTC 
GACGAAGCCG TCCATCACGC GCAGTTCGTC GACTGGATGC GCTCCGTCGC GCCCTATATC
CACAAGTTCC GCAACAGCAC CTTCGTGGTC GGCTTCGGCG GCGAGGTCGT GCAGCACGGG
CTGCTGAGCG CGCTCGTCTC CGACATCGCG CTGCTGCAGG CAATGGGCAT CCAGATCGTG
CTCGTGCACG GCTCGCGGCC GCAGGTCGAG GAGCAGCTGC ACCTGCACGG CGTCGAATCG
GCGTTCTCGC ACGGCCTGCG GATCACCGAC GCGCGCGCGC TCGAATCGGC GAAGGAAGCG
GCGGGCGAAG TGCGGCTCGA CATCGAGGCG GCGATCAGCC AGGGCTTGCC GAACACGCCG
ATGGCGCACG CGCACATCAG CGTCGTGTCG GGCAACTTCG TGACGGCGCG CCCGGTCGGC
ATTCTCGACG GCGTCGATTT CGCGCACACG GGCGTCGTGC GCAAGATCGA TGCCGAGTCG
ATCCGCCACT CGCTCGCGAG CCGCAAGCTC GTGCTGCTGT CACCGCTCGG CTTCTCGCCG
ACGGGCGAGG CGTTCAACCT GTCGATGGAG GATGTCGCGT CGGCGGCCGC GATCGCGCTG
CGCGCGGACA AGATCATCTT CCTGACCGAA TCGCCCGGCA TCGTCGACGA AGCGGGCGAG
CTCGTGCGCG AGATGTCGCT CGACGCGGCG GCCGAACTGC TCGACTCGGG CGATCTGCAA
GGCGACGACG GGTTCTTCCT GAAGCACGCG ATCCGCGCGT GCCGCGGCGG CGTCGCGCGC
GCGCACCTGA TACCGCAGTC GCTCGACGGC AGCGTGCTGC TCGAGCTGTT CCTGCACGAC
GGCGTGGGCA CGATGATCTC GTACGAGAAC CTCGAGAGCT TGCGCGAGGC GACGCCGGAC
GACGTGGGCG GCATCCTGAC CCTCATCGAG CCGCTCGAGA CGGACGGCAC GCTCGTGCGG
CGCGGCCGCC ACCAGATCGA GCGCGACATC GATCACTTCT CGGTCATCGA GCACGACGGC
GTGCTGTTCG GCTGCGCGGC GCTCTATCCG TACCCGACCG AGAAGATCGG CGAGATGGCG
TGTCTCACGG TCGCGCCCGA GGCGCAGGGC TCGGGCGACG GCGAGCGGCT CCTCAAGCGG
GTCGAGCAGC GCGCCCGCGC GCGCGGCCTC ACGCGGATCT TCGTGCTGAC GACGCGCACC
GAGCACTGGT TCCTCAAGCG CGGCTTCGTG AAGGCGACGG TCGACGACCT GCCCGAGGAC
CGCCGCAAGC TCTATAACTG GCAGCGCAAG TCGCTCGTGC TGATGAAGCA GCTCTGA
 
Protein sequence
MNSQTDLPAA QSGAAPQPSV DEAVHHAQFV DWMRSVAPYI HKFRNSTFVV GFGGEVVQHG 
LLSALVSDIA LLQAMGIQIV LVHGSRPQVE EQLHLHGVES AFSHGLRITD ARALESAKEA
AGEVRLDIEA AISQGLPNTP MAHAHISVVS GNFVTARPVG ILDGVDFAHT GVVRKIDAES
IRHSLASRKL VLLSPLGFSP TGEAFNLSME DVASAAAIAL RADKIIFLTE SPGIVDEAGE
LVREMSLDAA AELLDSGDLQ GDDGFFLKHA IRACRGGVAR AHLIPQSLDG SVLLELFLHD
GVGTMISYEN LESLREATPD DVGGILTLIE PLETDGTLVR RGRHQIERDI DHFSVIEHDG
VLFGCAALYP YPTEKIGEMA CLTVAPEAQG SGDGERLLKR VEQRARARGL TRIFVLTTRT
EHWFLKRGFV KATVDDLPED RRKLYNWQRK SLVLMKQL