Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_0911 |
Symbol | |
ID | 8224480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 1069072 |
End bp | 1070172 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644928775 |
Product | arsenical-resistance protein |
Protein accession | YP_003085329 |
Protein GI | 255034708 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.28145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.11905 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAAA ACCAAAATTT TATGCAAGAA AAGCGGCTTT CCTTCCTTGA CCGGTATTTG ACCGTCTGGA TCTTCCTGGC CATGCTTTTG GGTACGTCCA TAGGGTATTT TTTCTCGGGC ACACCCGGTT TTATCAATCG GTTTAATGCT GGTTCGACGG CAGCGGCCGG CGCCGGTGCG CTGGGCGTGA ACGTGCCGCT GGCCATCGGG CTGATCCTGA TGATGTACCC GCCGCTTGCC AAAGTGCGCT ATTCGGAACT TGCTCAAATA TTCGGGAATA CGCGCATTCT GGGCCTGTCA CTGCTGCAAA ACTGGATCGT GGGGCCTATC CTGATGTTCG GTCTGGCGGT GATATTCTTG CCCGATAAAC CGGAATACAT GGCCGGATTG ATCATTATTG GAATTGCGCG CTGCATTGCG ATGGTGATCG TCTGGAATGA CCTTGCCGGC GGCGACCGGG TGTATGCGGC CGGGCTGGTC GCATTCAACA GCATATTTCA GGTGTTGTTT TATTCGGTAT ACGCTTATGT GTTCGTGACG GTGTTACCGC CATTGTTTGG TTTGAAAGGG TTTAATGTCA ATATCACCAT CGGAGAAGTT GCGCAAAGCG TCTTCATTTA CCTCGGCATC CCATTCCTGG CAGGAATCAT TTCGAGGGAA GTTTTGACGA GGCTATTCAG CAAGCATTGG TACGAGCAGA AGTTCCTTCC GGCGATCAGC CCGATTACGC TGATTGCCCT GCTATTTACC ATTGTGGTGA TGTTCAGCCT GAAAGGCCAG TTGATTGTGA CGATCCCCAT CGATGTGCTG CGCATTGCGA TCCCGCTCTC AGCCTATTTC GCGATCATGT TCTTTTCCGC CTTTTATCTT TCAAAACGGG CAGGCGCGGG CTACTCAAAG TCTACTTCAC TTGCATTTAC GGCCGCAGGC AACAATTTTG AGCTGGGTAT TGCTGTGGCT ATCGCCGTGT TCGGTGTTGG TTCGGGCGCG GCATTTGCCG CAGTGATCGG CCCGCTGATC GAAGTGCCGG TATTGATCCT GATGGTGAGA TATGCCAAGA GCCAGCGGCA ATCATTTGCG GCTCTGAACA GAAATGTTTA G
|
Protein sequence | MIKNQNFMQE KRLSFLDRYL TVWIFLAMLL GTSIGYFFSG TPGFINRFNA GSTAAAGAGA LGVNVPLAIG LILMMYPPLA KVRYSELAQI FGNTRILGLS LLQNWIVGPI LMFGLAVIFL PDKPEYMAGL IIIGIARCIA MVIVWNDLAG GDRVYAAGLV AFNSIFQVLF YSVYAYVFVT VLPPLFGLKG FNVNITIGEV AQSVFIYLGI PFLAGIISRE VLTRLFSKHW YEQKFLPAIS PITLIALLFT IVVMFSLKGQ LIVTIPIDVL RIAIPLSAYF AIMFFSAFYL SKRAGAGYSK STSLAFTAAG NNFELGIAVA IAVFGVGSGA AFAAVIGPLI EVPVLILMVR YAKSQRQSFA ALNRNV
|
| |