Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0520 |
Symbol | |
ID | 6129261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 618052 |
End bp | 619107 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641640842 |
Product | arsenical-resistance protein |
Protein accession | YP_001767517 |
Protein GI | 170738862 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00313463 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGGCACCT TCGAACGCTA CCTGACCCTC TGGGTCGCCC TCTGCATCGT CGTCGGCGTC GCGCTCGGCC ACGCCGTGCC GGGCGTCTTC CGCGCCGTCG GCGCGGCCGA GATCGCCAAG GTCAACCTGC CGGTGGCGGG GCTGATCTGG CTCATGGTCG TGCCCATGCT GCTCAAGATC GACTTCGCCG CGCTGCGCCA CGTCGGGCGG CACTGGCGCG GCATCGGCGT GACGCTCCTG GTCAACTGGG CCGTGAAGCC GTTCTCGATG GCGGCGCTGG GCTGGCTGTT CATCGGCCAC CTGTTCCGGC CGCTGCTGCC GGCCGACCAG ATCGACGGCT ACGTCGCCGG GCTGATCATC CTGGCCGCGG CGCCCTGCAC CGCGATGGTG TTCGTGTGGT CGAACCTCAC GCGGGGCGAG CCGCACTTCA CGCTGAGCCA GGTGGCGCTC AACGACGGCA TCATGGTGGT GGCCTTCGCG CCCATCGTCG GGCTGCTGCT CGGGCTCTCG GCCATCACGG TCCCGTGGGG CACGCTGGTC CTCTCGGTCG TGCTCTACAT CGTCATCCCG GTCGTGGTCG CGCAGGCGGC CCGGCGCAGC CTGCTCGCCG CGGGCGGCCA GCCCGCCCTC GACCGCCTCC TGGCGCGGCT GGGGCCGGCC TCGCTGGCGG CGCTGCTGGC GACCCTCGTC CTGCTGTTCG GCTTCCAGGG CGAGCAGATC CTGGCCCAGC CGGCCGTGAT CGCGCTCCTG GCGGTCCCCA TCCTGATCCA GGTCTACCTG AACAGCGGGC TCGCCTACCT CCTGAACCGG CTCGCCGGGG AGCAGCACTG CGTCGCCGGC CCCTCCGCGC TGATCGGGGC CTCCAACTTC TTCGAACTCG CGGTGGCGGC GGCCATCTCG CTGTTCGGGT TCGAGTCGGG CGCGGCGCTC GCCACGGTGG TGGGCGTGCT GATCGAGGTC CCGGTCATGC TCAGCGTGGT CCGGATCGTG AACCGCTCCA AGGACTGGTA CGAGCGCGGG GCGGCGCGGG CGCGGCTCGC GCCCCAGGAG GGGTGA
|
Protein sequence | MGTFERYLTL WVALCIVVGV ALGHAVPGVF RAVGAAEIAK VNLPVAGLIW LMVVPMLLKI DFAALRHVGR HWRGIGVTLL VNWAVKPFSM AALGWLFIGH LFRPLLPADQ IDGYVAGLII LAAAPCTAMV FVWSNLTRGE PHFTLSQVAL NDGIMVVAFA PIVGLLLGLS AITVPWGTLV LSVVLYIVIP VVVAQAARRS LLAAGGQPAL DRLLARLGPA SLAALLATLV LLFGFQGEQI LAQPAVIALL AVPILIQVYL NSGLAYLLNR LAGEQHCVAG PSALIGASNF FELAVAAAIS LFGFESGAAL ATVVGVLIEV PVMLSVVRIV NRSKDWYERG AARARLAPQE G
|
| |