Gene Rmet_3991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_3991 
SymbolarsB1 
ID4040849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp559311 
End bp560570 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content67% 
IMG OID637979415 
Productarsenite permease 
Protein accessionYP_586128 
Protein GI94312919 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0830422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.159952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCT ACGCGACACC GCTGATCTGG AGCGTCGCGG CACTATCCAC GGCGGGCGTG 
CTGTTCCGGC CCTTTCGCCT GCCCGAGCCG TTCTGGGCAA TGGCTGGCGC GCTGGTGCTA
TGCGTGGCAG GACTGCTGCC ATGGCGCGAC GCCCTGCAAG CCGTGGCACG CGGCAACGAC
GTCTATCTGT TCCTGGCCGG GATGATCCTG ATCTCGGAAC TCGCCCGCAA GACAGGTCTG
TTCGATCATG TGGCCGCGCT GGCCGTGCGC GCCGCGCGGG GGTCGGCGCG CAAGCTATTT
GCGCTGGTCT ATGGCTTCGG CATTGCGGTG ACAGCGTTCA TGTCGAACGA TGCCACGGCA
GTCGTCCTCA CCCCCGCGGT CATCGCCGCG ACACGCGCGG CACGGGTCAA GCACCCGTTA
CCCTATCTCT ACGCCTGCGC GTTCATCGCC AATGCGGCGA GCTTTCTCCT GCCGATCTCT
AATCCGGCCA ACCTCGTGCT GTTCGGTGAC CGCATGCCAC CTCTGACCAG TTGGCTGGCA
CGCTTCACGC TGCCGTCGGT GGTGGCCATC GCCATGACGT TCATCGTCCT GTACTGGACG
CAACGCGATG CACTGGCCGA GCCGATTGAG AACGACGTGC CGACGCCCCC TCTGACGCTC
CAGGCCTGGC TGACGACGCT GGGCATCATG CTGACCGGAG CGGCACTGTT GACGGCCTCG
CTGCACGGGC AGGATCTCGG CTGGCCGACG TTCATCGGTG GTCTGTTGAC CCTGGCTGTC
GTCTGCGCCA CCCAGCCGCG ACTGCTTGTG CCGGCGCTCA AGGAGGTGTC CTGGGGCGTA
TTGCCGTTGG TGGCCGGACT GTTCGTCCTG GTTGCCGGCC TGGCCCAGAC CGGCTTGACC
GCTCAGCTTG CACATTGGGT GCGGATGCTA TCCGGGCTGC AAGGGCCGGA GGCCGTGCTT
GGCGCCGGTG TGGCGGGCGT GCTCGTCGGC ATCACGAGCA ACATCGTCAA CAACCTGCCG
GCCGGACTGT TCGCAGCCTC GGCGCTGGCG GCAGGCCACG CCTCTGATAC CGTCACGGCT
GCCGTGCTGA TCGGTGTGGA CCTGGGCCCG AACCTGTCCA TTACCGGCTC GCTGGCCACC
CTGCTCTGGC TGACCGCCCT GCGCCGTGAA GGTCATATGG TCGGCGCCGG CACCTTCCTG
AAGACCGGTG CGCTCGTCAT GCCGCTGGCA CTACTCCCGG CCCTGGCGGT ACTGCGCTGA
 
Protein sequence
MPAYATPLIW SVAALSTAGV LFRPFRLPEP FWAMAGALVL CVAGLLPWRD ALQAVARGND 
VYLFLAGMIL ISELARKTGL FDHVAALAVR AARGSARKLF ALVYGFGIAV TAFMSNDATA
VVLTPAVIAA TRAARVKHPL PYLYACAFIA NAASFLLPIS NPANLVLFGD RMPPLTSWLA
RFTLPSVVAI AMTFIVLYWT QRDALAEPIE NDVPTPPLTL QAWLTTLGIM LTGAALLTAS
LHGQDLGWPT FIGGLLTLAV VCATQPRLLV PALKEVSWGV LPLVAGLFVL VAGLAQTGLT
AQLAHWVRML SGLQGPEAVL GAGVAGVLVG ITSNIVNNLP AGLFAASALA AGHASDTVTA
AVLIGVDLGP NLSITGSLAT LLWLTALRRE GHMVGAGTFL KTGALVMPLA LLPALAVLR