Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2882 |
Symbol | |
ID | 5695740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 3467409 |
End bp | 3468599 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641265497 |
Product | arsenical-resistance protein |
Protein accession | YP_001530762 |
Protein GI | 158522892 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAA AAGCCGACGA CCGTAAGATG ACAAGCCTGT TTGAGCGCTA TCTGACCGTC TGGGTGTTCT TGTGCATTCT TGCCGGCATC GTGCTGGGAA AGGCGGCGCC GGGCGTGGCC CGTTACCTGG ACGGGCTGGC CATTTATGTA AACGGCGCGC CGGTGGTGTC GATTCCCATT GCCGTCTGCC TCTTTTTCAT GATGTACCCC ATCATGGTAA AGATCGATTT TGCCTCGGTG GTCCGGGCCG GCAAAAGCGG CAAACCGGTG TTTCTGACCC TGTTTGTCAA CTGGTGCGTC AAGCCCTTTA CCATGTATGC CATTGCCTCC TTTTTCCTGG GCACCCTGTT TTACAACTTT ATCGGCCCGG ATGCCGTGGA CCTGGTCAAG ATGCCCTTCG GCCTGGACCT GCCCGTGGGC GCGGCCCACG GCGCCGGCAC GGTGGTGATG GTCGACGGCG TCAAGATGAT GGAAGTTCCC CTCTGGCGCA GCTACCTGGC CGGCTGTATC CTGCTGGGCA TCGCCCCCTG CACGGCCATG GTGCTGGTGT GGGGATTTCT TTCCAAAGGC AACGACGGCC TCACCCTGGT GATGGTGGCC ATCAACTCCC TTACCATGCT GGTGCTTTAC GGCGTGCTGG GCGGCTTTCT GCTGGGTGTG GGAAAGCTGC CCGTGCCCTG GCAGGCGCTG CTGCTGTCCA TCGGCATCTA CGTGGCCCTG CCCCTGGTGG CCGGCTATTT TTCCCGCCGA TGGGTCATAT CCGCCAAAGG CGAAACCTGG TTTCAGGAAA AATTTCTGCA TGTGCTCACC CCTGTCACCA TCACGGCCCT GCTGGCCACC CTGGTGCTGC TCTTCTCCTT TAAAGGCGAT GTCATTGTTG AAAACCCCCT GACCATTGTC TGGATCGCGG TTCCGCTGTT TATTCAGACC GTGCTGATTT TTGCCCTGGG TTACGGCCTG GCCCGGCTGT TTAAGCTGAC TTACGAGGAC GCGGCGCCAG CGGCCATGAT CGGGGCCTCC AACCATTTCG AGGTGGCCAT TGCCACGGCC ACCATGCTGT TCGGCCTTTC GTCCGGCGCG GCCCTGGCCA CGGTGGTGGG CGTGCTGATC GAGGTACCGG TGATGCTCAT GCTGGTAAAA ATCTGCCTGC GGACCCGGCA CTGGTTTGAT ACCCAAAACC AAAAAGGGTA G
|
Protein sequence | MNAKADDRKM TSLFERYLTV WVFLCILAGI VLGKAAPGVA RYLDGLAIYV NGAPVVSIPI AVCLFFMMYP IMVKIDFASV VRAGKSGKPV FLTLFVNWCV KPFTMYAIAS FFLGTLFYNF IGPDAVDLVK MPFGLDLPVG AAHGAGTVVM VDGVKMMEVP LWRSYLAGCI LLGIAPCTAM VLVWGFLSKG NDGLTLVMVA INSLTMLVLY GVLGGFLLGV GKLPVPWQAL LLSIGIYVAL PLVAGYFSRR WVISAKGETW FQEKFLHVLT PVTITALLAT LVLLFSFKGD VIVENPLTIV WIAVPLFIQT VLIFALGYGL ARLFKLTYED AAPAAMIGAS NHFEVAIATA TMLFGLSSGA ALATVVGVLI EVPVMLMLVK ICLRTRHWFD TQNQKG
|
| |