Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3866 |
Symbol | |
ID | 5833900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4292393 |
End bp | 4293469 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641369656 |
Product | arsenical-resistance protein |
Protein accession | YP_001641309 |
Protein GI | 163853266 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.222492 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTGT TCGAACGTTT CCTCACCCTC TGGGTGGCCC TGTGCATCGT GGCCGGCATC GCGCTCGGCC ACGTCATGCC CGGCGTCTTC CACGCTGTCG GCGCCGCCGA GGTCGCCAAG GTGAACCTGC CGGTGGCGGT GCTGATCTGG CTCATGGTCA TCCCCATGCT GCTCAAGATC GACTTCGCCG CGATGCGCAA GGTCGGGCGG CACTGGCGCG GCATCGGCGT GACGCTCTTC ATCAACTGGG CGGTGAAGCC GTTCTCGATG GCGGCGCTGG GCTGGCTGTT CATCGGCACC CTGTTCCGAC CGTACCTGCC CGCAGAGCAG ATCGACAGCT ACATCGCCGG GCTCATCATC CTGGCCGCCG CGCCCTGCAC GGCGATGGTG TTCGTGTGGT CGAACCTGAC CCGCGGCGAG CCGCACTTCA CGCTGAGCCA AGTGGCGCTC AACGACACCA TCATGGTGGT GGCGTTCGCG CCCCTCGTCG GCCTTCTGCT CGGCCTCTCG GCCATCACCG TGCCCTGGGG AACGCTGGTG CTGTCGGTCG TGCTCTACAT CGTCATCCCC GTCGTGATCG CGCAGGCGGT CCGCCGAATT CTGCTGGCGT CCGGCGGCCA AGCCGCCCTC GACCGCCTGC TCGGCCGGCT CGGTCCGGTC TCGCTGGTGG CGTTGCTGGC CACCCTCGTG CTGCTGTTCG GCTTCCAGGG CGAGCAGATC CTCGCCCTGC CGGCGGTCAT CGGTCTGCTC GCGGTTCCGA TCCTCATCCA GGTCTACCTG AACGCGGGGC TGGCCTACCT GCTCAACCGC GCTGCGGGCG AACAGCACTG CGTCGCCGGC CCCTCGGCGC TGATCGGCGC CTCGAACTTC TTCGAGCTCG CGGTGGCGGC CGCCATCAGC CTGTTCGGCT TCAACTCGGG CGCGGCGCTC GCCACCGTGG TCGGCGTCCT CATCGAGGTG CCGGTCATGC TGTCGGTCGT GTGGATCGTG AACCGCTCGA AGGGCTGGTA CGAGCGCGGT GCGTCAGCAC GAGGCACGGC GCTGCAATCC GTTTCCCGCC AGACCCGAGT CGGGTAG
|
Protein sequence | MSLFERFLTL WVALCIVAGI ALGHVMPGVF HAVGAAEVAK VNLPVAVLIW LMVIPMLLKI DFAAMRKVGR HWRGIGVTLF INWAVKPFSM AALGWLFIGT LFRPYLPAEQ IDSYIAGLII LAAAPCTAMV FVWSNLTRGE PHFTLSQVAL NDTIMVVAFA PLVGLLLGLS AITVPWGTLV LSVVLYIVIP VVIAQAVRRI LLASGGQAAL DRLLGRLGPV SLVALLATLV LLFGFQGEQI LALPAVIGLL AVPILIQVYL NAGLAYLLNR AAGEQHCVAG PSALIGASNF FELAVAAAIS LFGFNSGAAL ATVVGVLIEV PVMLSVVWIV NRSKGWYERG ASARGTALQS VSRQTRVG
|
| |