Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3966 |
Symbol | |
ID | 5835623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4406632 |
End bp | 4407900 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641369757 |
Product | arsenical pump membrane protein |
Protein accession | YP_001641408 |
Protein GI | 163853365 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1055] Na+/H+ antiporter NhaD and related arsenite permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.317509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCGC TGATCCCGAA CCCGAACGCG GCGACCTGGG GTATCGCCGC GCTGGCGACG CTCGGCGTGA TCCTGCGCCC GTTCTCCTGG CCGGAGGCGA TCTGGGCGGT GCTCGGCTCG GTGCTCCTCG TCCTCCTCGG CCTGATCCCC TGGCAGAATG CCCTGGAGGG CGCGGCCAAG GGCACCGACG TCTATCTCTT CCTCGTGGGG ATGATGCTGC TCTCCGAGAT CGCCCGGAAG CAGGGCTTGT TCGATTGGCT CGCCGCCCAC GCGGTGCGGG CCGCGAAGGG GTCGCCGACG CGGCTGTTCT CGCTCGTCTA CGTCGTCGGC ACGGTGGTCA CGGTCTTCCT CTCGAACGAT GCCTGCGCGG TGGTGCTGAC GCCCGCCGTC TTCGCCGCGA CGCGGGCCGC CGGGGTGAAG CAGCCCCTGC CCTACCTGTT CGTCTGCGCC TTTATCGCCA ACGCGGCGAG CTTCGTGCTG CCGATCTCGA ACCCGGCCAA CCTCGTCGTC TTCGCCGAGC ACATGCCGCC GCTCGGCCGA TGGCTGGCGA CCTTCACCCT GCCCTCCCTC CTCGCCATCG TCGCGACCTA TCTCGTCCTG CGCCTGACCC AGAACGCGCG GCTGAAGGCC GAGACGGTCG CGACCGACGT CGCGATCCCG AGGCTCGGGC TCGGCGGCAC GATCGCGGCC GGGGGCATCG TCGCCACCGG CGCGGCCCTG ATCGGCGCCT CGGCCGCCGG GATCGAACTC GGCCTGCCGA CCTTCATCGC CGGGCTCGCC ACGACCCTCG TCGTGCTCGC AATCAACCGG GGCGGGCTGG TCGCGGTCGC TCGGGACGTC TCCTGGGGCG TGCTGCCGCT GGTCGCCGGG CTCTTCGTCC TCGTCGAGTC CCTGGAGAAA ACCGGCCTGC TCGCGAAACT CGCCGACCTC CTGGGCCGCG CCGCGCAGGG CGATCCCGCC GCGACGGCCT GGGCCGGCGG CGCGCTCGTC GCCTTCGGAT CGAACCTCGT GAACAACCTG CCGGCGGGTC TCCTGGCGGG CGCGGCGGTG CAGGCCGCCC ATGTGCCGGA GACGGTGGCG GGGGCGATCC TGATCGGCGT CGATCTCGGG CCGAACCTCT CGGTCACGGG CTCGCTCGCC ACGATCCTCT GGCTCACCGC GATCCGCCGC GAGGGCCAAA ACGTCTCCGC CTGGGCTTTC CTGAAGCTCG GCGCCCTGGT CATGCCCCCG GCGCTGGCGC TGGCCCTCGC GGCTCTGATC CTCGCCTGA
|
Protein sequence | MGALIPNPNA ATWGIAALAT LGVILRPFSW PEAIWAVLGS VLLVLLGLIP WQNALEGAAK GTDVYLFLVG MMLLSEIARK QGLFDWLAAH AVRAAKGSPT RLFSLVYVVG TVVTVFLSND ACAVVLTPAV FAATRAAGVK QPLPYLFVCA FIANAASFVL PISNPANLVV FAEHMPPLGR WLATFTLPSL LAIVATYLVL RLTQNARLKA ETVATDVAIP RLGLGGTIAA GGIVATGAAL IGASAAGIEL GLPTFIAGLA TTLVVLAINR GGLVAVARDV SWGVLPLVAG LFVLVESLEK TGLLAKLADL LGRAAQGDPA ATAWAGGALV AFGSNLVNNL PAGLLAGAAV QAAHVPETVA GAILIGVDLG PNLSVTGSLA TILWLTAIRR EGQNVSAWAF LKLGALVMPP ALALALAALI LA
|
| |