Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3744 |
Symbol | |
ID | 4598606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3963347 |
End bp | 3964438 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639778352 |
Product | arsenical-resistance protein |
Protein accession | YP_924931 |
Protein GI | 119717966 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.171539 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGACA CTGCAGCGGC GCAGGTCCGT GCCGACGTCG TCGGCCGGCT CCCCACGCTC GACCGGTACC TGCCGGTGTG GATCGGGCTG GCGATGGCGG CCGGGCTGCT GCTCGGCCGC TGGGTGCCCG GCATCGCCGA CGTCCTCGAC GCGATCACGA TCGCCTCGGT GTCGCTGCCG ATCGCGCTCG GGCTGCTGGT GATGATGTAC CCCGTCCTCG CCAAGGTCCG CTACAACGAG GTGGGCGACG TCGCTCGCGA CACCCGCATG ATGGCGCTGT CGGTGGTGCT GAACTGGGTC GTCGGTCCGG CCCTGATGTT CACCCTGGCC TGGGTGTTCC TCGCCGACCT GCCCGAGTAC CGCACCGGGC TGATCATCGT GGGCCTGGCC CGGTGCATCG CGATGGTGAT CATCTGGAAC GACCTCGCCT GCGGGGACCG GGAAGCGGCG GCCGTCCTGG TCGCCCTGAA CTCGGTCTTC CAGGTGCTCG CCTTCGCCCT GCTCGGCTGG TTCTACCTCG ACCTGCTGCC CGGCTGGCTC GGCCTCTCCG GCACCGGGCT GGAGGTGTCG CCCTGGCAGA TCGCCTGGAG CGTGGTGGTG TTCCTCGGCA TCCCGCTCGC TGCCGGCTAC CTCAGCCGCC GGGCGGGAGA ACGACGCCGC GGCCGCGAGT GGTACGAGCA GCGGTTCCTG CCACGGATCG GGCCGTGGGC GCTGTACGGA CTCCTGTTCA CCATCGTGGT GCTGTTCGCA CTGCAGGGCG ACACCATCAC CAACCAGCCG GCCGACGTCG CGCGCATCGC CGTCCCCCTG GTCGTCTACT TCGCCCTGAT GTGGGGCGGG TCGATGCTGG CCGCCCACCG TGCGGGGCTC GGCTACCGCC GATCCACCAC GGTCGCCTTC ACGGCAGCCG GCAACAACTT CGAGCTCGCG ATCGCGGTGG CCATCGCCGT GTACGGCGTC ACCAGCGGGC AGGCGCTCGC GGGAGTCGTC GGCCCGCTGA TCGAGGTGCC CGTCCTCGTC GGCCTGGTCT ACGTGAGCCT CTGGGCCCGC CGCTTCTTCC CCGACACCGT CCAGGAGGAC CTACCCCGAT GA
|
Protein sequence | MSDTAAAQVR ADVVGRLPTL DRYLPVWIGL AMAAGLLLGR WVPGIADVLD AITIASVSLP IALGLLVMMY PVLAKVRYNE VGDVARDTRM MALSVVLNWV VGPALMFTLA WVFLADLPEY RTGLIIVGLA RCIAMVIIWN DLACGDREAA AVLVALNSVF QVLAFALLGW FYLDLLPGWL GLSGTGLEVS PWQIAWSVVV FLGIPLAAGY LSRRAGERRR GREWYEQRFL PRIGPWALYG LLFTIVVLFA LQGDTITNQP ADVARIAVPL VVYFALMWGG SMLAAHRAGL GYRRSTTVAF TAAGNNFELA IAVAIAVYGV TSGQALAGVV GPLIEVPVLV GLVYVSLWAR RFFPDTVQED LPR
|
| |