Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1822 |
Symbol | |
ID | 5712813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1900181 |
End bp | 1901206 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267745 |
Product | putative arsenic resistance protein |
Protein accession | YP_001533165 |
Protein GI | 159044371 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.932603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCGGT TCGAGCGGTA TCTGTCGGTC TGGGTGGCGC TGGCCATCGG CGCGGGGCTG GTTTTGGGCA GCGTGGTGCC GGGGGTGTTC GCGGGGCTTG CGGCGCTGGA GGTGGCGTCG GTCAACCTGC CCGTGGCGGT GCTGATCTGG GCGATGGTGT TCCCGATGAT GGTGGGCGTG GATTTCGGCG CGCTCGGGGG CGTGGCGACC CGGCCCAAGG GGCTGGTGAT CACGCTGGTG GTGAACTGGC TGGTCAAGCC GTTCACCATG GCCGCGTTGG GCGTGCTGTT TTTCAACCAC GTCTTCGCGC CCTTCATCCC GCCGGAGGAT GCTCAGATGT ACCTGGCGGG CGTGATCCTG CTGGGGGCCG CGCCCTGCAC GGCGATGGTG TTCGTGTGGT CGAACATGAC CAAGGGCGAT CCGAATTACA CGCTGGTGCA GGTGAGCGTG AACGACGTGG TGCTGGTCTT TGCCTTCGCG CCCATCGTGG CGCTCTTGCT GGGGGTGACG GATATCGTGG TGCCGTGGGA GACGCTGCTG TTGTCCGTGG CGCTTTACGT GGTGATCCCG CTGGCCGTGG GCGTGGTGGT GCGCAAACGG CTGCTGGACG GCGGAGGGGC GGAGGCGCTG GCCGCGTTCC AGGCGCGGGT AAAGCCCGCC TCGGTAGCGG GGCTTTTGCT GACCGTGGTG CTCCTGTTCG GGTTCCAGGG GCGCGTGATC CTGGAGACGC CTTTGGTGAT CGCGATGATC GCCGTGCCGC TGCTGATCCA GAGCTACGGG GTGTTCTTCG TGGCCTATTA CGCCGCCAAG GCCTGGCGCG TGCCGCACGA GGTCGCCGCC CCCTGCGCCC TGATCGGGAC GTCGAATTTC TTCGAGCTGG CCGTGGCCGT GGCGATCAGC GTCTTCGGCC TCGGCTCCGG CGCGGCGCTG GCCACGGTGG TCGGCGTGCT GGTGGAGGTG CCGGTGATGC TGTCGCTGGT GGCGTTTGCC AACCGGACGC GGGGGTGGTT TCCTAAGTCC CCGTGA
|
Protein sequence | MGRFERYLSV WVALAIGAGL VLGSVVPGVF AGLAALEVAS VNLPVAVLIW AMVFPMMVGV DFGALGGVAT RPKGLVITLV VNWLVKPFTM AALGVLFFNH VFAPFIPPED AQMYLAGVIL LGAAPCTAMV FVWSNMTKGD PNYTLVQVSV NDVVLVFAFA PIVALLLGVT DIVVPWETLL LSVALYVVIP LAVGVVVRKR LLDGGGAEAL AAFQARVKPA SVAGLLLTVV LLFGFQGRVI LETPLVIAMI AVPLLIQSYG VFFVAYYAAK AWRVPHEVAA PCALIGTSNF FELAVAVAIS VFGLGSGAAL ATVVGVLVEV PVMLSLVAFA NRTRGWFPKS P
|
| |