Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0212 |
Symbol | |
ID | 5832302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 231193 |
End bp | 232200 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641365997 |
Product | C4-dicarboxylate transporter/malic acid transport protein |
Protein accession | YP_001637709 |
Protein GI | 163849666 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1275] Tellurite resistance protein and related permeases |
TIGRFAM ID | [TIGR00816] C4-dicarboxylate transporter/malic acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.187317 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCTC CCCTGGCCCC GGCGGCGACG GAGGCGCCCG CCGTCCCGGT GCCGCGGTTC GATTACCTTC CGGTCGCCCT GTTCGGCTCC GTGATGGGGC TGACGGGCCT GAGCGTCGCG TGGCGCCTCG CGGCCGCGCG GTACGGCCTG CCCCCCCTCG TCGCCGATGC GATCGGCTGG AGCGCCGTCC TCACCTTCCT GGCCCTCGCG CTGGCCTACG GCGCGAAGGC GGTCACGGCC TGGCCGGCGG TGACCGCCGA GTTCCGCCAC CCGATCGCCG GCAACCTGTT CGGCACCGTC CTGATCAGCC TGCTGCTCCT GCCCTTCGTG CTGCACGACC TCAGCCCGCC GCTCGCGGCG GCGGCCTGGA TCGTCGGCGC GGGCGGCATG GCCCTGTTTG CCGTCCTCAT CGTCAGCCGC TGGATGGGCA GTCGCCAGCA GCTCGCCCAC GCGACGCCCG CCTGGATGGT GCCGGTGGTC GGCCTTCTCG ACATCCCGCT CGCCGCGCCC GCACTGGGCC TGCCCCACAC GCAGACGCTT GCGATGGCCG CGCTGAGCAT CGGCTTGTTC TTCGCCGGCC CGCTCTTCAC CCTGGTCTTC GCCCGCCTCG TCTTCGAGGA GCCGCTGCCG CCGGCCCAAC GGCCGACGCT GATGATCCTG GTCGCGCCCT TCGCCGTCGG CTTCTCCAGC TACACGGCGA CCTTCGGGCG GATCGACGCC TTCGCCGAGG CGCTGTTCTT GGTTGGCCTG TTCATGTTCG TGGTCCTCCT CGGCCGTCTG CGCGACCTGC CGCGCTGCTG CCCATTCCGC GTCTCGTGGT GGGCGGTGAG CTTCCCGCTC GCTGCCATGG CGGTGGCGGC CCTGCGCTAC GCCGAGCACG TCCGCGCGGT CCCCGCCGAC GTCCTGGCGC TGGCGCTGCT CGCGCTGGCC ACGGCCGGCA TCGCCGCGCT CGCAACCCGA ACCCTCCTCG GCATCGCCCG CGGTGAGCTG CGCCGGCTCG CCGGCTGA
|
Protein sequence | MAAPLAPAAT EAPAVPVPRF DYLPVALFGS VMGLTGLSVA WRLAAARYGL PPLVADAIGW SAVLTFLALA LAYGAKAVTA WPAVTAEFRH PIAGNLFGTV LISLLLLPFV LHDLSPPLAA AAWIVGAGGM ALFAVLIVSR WMGSRQQLAH ATPAWMVPVV GLLDIPLAAP ALGLPHTQTL AMAALSIGLF FAGPLFTLVF ARLVFEEPLP PAQRPTLMIL VAPFAVGFSS YTATFGRIDA FAEALFLVGL FMFVVLLGRL RDLPRCCPFR VSWWAVSFPL AAMAVAALRY AEHVRAVPAD VLALALLALA TAGIAALATR TLLGIARGEL RRLAG
|
| |