Gene BURPS1106A_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2016 
SymbolargG 
ID4902486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1978941 
End bp1980155 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content66% 
IMG OID640135246 
Productargininosuccinate synthase 
Protein accessionYP_001066281 
Protein GI126451449 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000804601 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCCA AACACATTCT CCTCGCTTAT TCCGGCGGCC TCGACACGTC CACCGCGCTG 
CACTTCCTGA AGCGGCATTT CGATTGCCGC GTCACCGCCT ATTGCGCGAA CCTCGGGCAG
AAGGAGGATT GGGAGCGGAT GAAACGCCGC GCGGCGATCG CCGGCGCGGA CGAGCTGGTC
ATCGAGGATC TGCGCGAGAC GTTCATCGGC GATTTCGTGT TTCCCGCACT GAAGGCGAAC
GCGTCGTACG AGCGGGACTA TCTGCTCGGC ACGCCGCTCG CCCGCCCGGC GATCGTCAAG
GGGCTCATCG AGTACGCGCG CAAGCACGAC GTCGATTGCC TGTCGCACGG CTGCACGCAG
AAGGGCAACG ATCAGGTGCG CTTCGAGATG GCCGCGAAGA TTCTCGCGCC CGATCTGCCG
ACGGTCGCGC CGTGGCGCAT CTGGTCGCTG CAGTCGCGCG AGGATCTGTT CGCGTATTGT
CAGCAGCACG GCATTCCGGT CGAAAGCCGT CCGGACAATC TGTTGAGCCA CGACGAGAAT
CTCGTGCACA TCACGACGGA GGGCGACTAT CTGGAGAGCG TCGCGAACGC GTTCGACTGG
CGCGACGCGA ACTGGATCAC GCCGCCCACG CAAGCGCCGG ATGCGATCGA GACGATCACG
CTCGGGTTCC GCCGGGGCGT GCCCGTCAGC GTCGACGGCG CGGCGCTCGG GCCGGTCGAG
CTGGTCGAGC GGCTCAACGA AGCGGGCGCC CGCAACGGCG TCGGCTTCCA GGACATCATC
GAGAACCGCA TCAACGGCCT GAAGGTGCGC GGCGTGTTCG AGAACCCCGC GCTGACGATC
CTGCACGCCG CGCATCGCAA GCTCGAGAAG ATCACGCTCG GCCGCGACGT CGAGCGCCTG
CGCAACCTCG TGTCGGACGA CTACGGCGAC ATCGTCTACC GCGGCCTGTG GTTCAGCGAC
GAGCGGCTCT GCCTGCAGGC GCTCATCGAC GAATCGCAGA AGTACGTGAG CGGCGACGTG
AAGGTTCAGC TCTACAAGGG TTCGTGCACG CCGTGCGCCG TCGAATCGGA GCAGTCGCTT
TATTCGCGCG AGCTCGTGAC GCTGCACGCG GGCCGCGCGA TCAGCGGCGA GGACGCGACG
GGCTTCCTGA ACACGCTCGG CCTGCGTATC GGCATCGAAG CCGCGCGCGC CGGCAACACG
GGAGCCGGCG CATGA
 
Protein sequence
MKPKHILLAY SGGLDTSTAL HFLKRHFDCR VTAYCANLGQ KEDWERMKRR AAIAGADELV 
IEDLRETFIG DFVFPALKAN ASYERDYLLG TPLARPAIVK GLIEYARKHD VDCLSHGCTQ
KGNDQVRFEM AAKILAPDLP TVAPWRIWSL QSREDLFAYC QQHGIPVESR PDNLLSHDEN
LVHITTEGDY LESVANAFDW RDANWITPPT QAPDAIETIT LGFRRGVPVS VDGAALGPVE
LVERLNEAGA RNGVGFQDII ENRINGLKVR GVFENPALTI LHAAHRKLEK ITLGRDVERL
RNLVSDDYGD IVYRGLWFSD ERLCLQALID ESQKYVSGDV KVQLYKGSCT PCAVESEQSL
YSRELVTLHA GRAISGEDAT GFLNTLGLRI GIEAARAGNT GAGA