Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1943 |
Symbol | arsB |
ID | 4905044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1899341 |
End bp | 1900411 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640145049 |
Product | arsenical-resistance protein |
Protein accession | YP_001075977 |
Protein GI | 126455737 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.926392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCAT CCAACGTCGC CCCGCCGGAA AACGCCGCCG CCCAGCCCGC CATCAACTTC TTCGAACGTT ACCTGACCGT CTGGGTGGCG CTGTGCATCG TCGCCGGCAT CGCGCTCGGC CAGGCGCGGC CCGATCTGTT CCGGCAGATC GGCCGGATGG AATACGCGCA AGTGAATCTC CCGGTCGGGC TGCTGATCTG GGTGATGATC ATCCCGATGC TCGTCAAGGT CGATTTCGGC GCATTGCACG AAGTCCGCCG GCACGTCAAG GGCATCGGCG TCACGCTCGT CGTGAACTGG CTCGTCAAGC CGTTTTCGAT GGCCTTTCTC GGCTGGCTGT TCATCAAGCA GTTCTTCGCG CCGATGCTGC CCGCGGCGCA GCTCGACAGC TACCTCGCCG GCCTGATCCT GCTCGCCGCC GCGCCGTGCA CGGCGATGGT GTTCGTCTGG AGCCGGCTCA CGGGCGGCGA TCCGCTGTTC ACGCTGTCGC AGGTGGCGCT GAACGACAGC ATCATGGTGA TCGCCTTCGC GCCGCTCGTA GGGCTGCTGC TCGGGATGTC CGCGATCACG GTGCCGTGGG CGACGCTGCT CACGTCGGTC GTGCTCTACA TCGTCATCCC GGTGATCCTC GCGCAACTCT GGCGCAAGCG ACTGCTCGCG AACGGACAGG CGGCGCTCGA CGCCGCGATG GCGAAGATCG GCCCCTGGTC CATCGCCGCG CTGCTCGCCA CGCTCGTGCT GCTGTTCGCG TTCCAAGGCG AGGCAATCCT CGCGCAACCG CTCGTGATCG CGCTGCTCGC CGTTCCGATC CTGATTCAGG TGTTCTTCAA TTCGGCGCTC GCGTACTGGC TGAACCGCGC GGTCGGCGAG AAGCACGACA TCGCGTGCCC GTCGGCGCTC ATCGGCGCCT CCAACTTCTT CGAGCTGGCG GTCGCCGCCG CGATCAGCCT GTTCGGTTTC CACTCGGGCG CGGCGCTGGC GACGGTGGTC GGCGTGCTGA TCGAAGTACC CGTGATGCTG CTGGTGGTGC GCATCGTCAA CCGGACCAAG GGCTGGTACG AGCGGACTTG A
|
Protein sequence | MNPSNVAPPE NAAAQPAINF FERYLTVWVA LCIVAGIALG QARPDLFRQI GRMEYAQVNL PVGLLIWVMI IPMLVKVDFG ALHEVRRHVK GIGVTLVVNW LVKPFSMAFL GWLFIKQFFA PMLPAAQLDS YLAGLILLAA APCTAMVFVW SRLTGGDPLF TLSQVALNDS IMVIAFAPLV GLLLGMSAIT VPWATLLTSV VLYIVIPVIL AQLWRKRLLA NGQAALDAAM AKIGPWSIAA LLATLVLLFA FQGEAILAQP LVIALLAVPI LIQVFFNSAL AYWLNRAVGE KHDIACPSAL IGASNFFELA VAAAISLFGF HSGAALATVV GVLIEVPVML LVVRIVNRTK GWYERT
|
| |