Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2343 |
Symbol | arsC2 |
ID | 4784563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2513072 |
End bp | 2513902 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640090912 |
Product | arsenate reductase |
Protein accession | YP_001021534 |
Protein GI | 124267530 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1246] N-acetylglutamate synthase and related acetyltransferases [COG1393] Arsenate reductase and related proteins, glutaredoxin family |
TIGRFAM ID | [TIGR00014] arsenate reductase (glutaredoxin) [TIGR01617] transcriptional regulator, Spx/MgsR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACCG TCACGATCTA CCACAACCCG GACTGCGGCA CGTCGCGCAA CACGCTGGCG CTGATCCGCG CCAGCGGCAT CGAGCCCACG GTCATCGAGT ACCTGAAAAC GCCGCCCGAC CGCGAGACCT TGAAGGCGCT GATCGCGCGG ATGGGCATGG GCGTGCGGGA TGTGTTGCGC ATCAAGGGCA CGCCCTACAA GGAACTGGGC CTGGATGCCG CGCATTGGAG CGACGACCAG TTGATCGATC AGATGCTGGC GTACCCGATC CTCATCAATC GGCCCATCGT GGTGTCGCGC TCGGGCGTGC GCCTGTGCCG CCCCTCGGAC ACGGTGATCG ACCTGCTGCC GCAACGACCG GCGGGCGAGA ACCGCAAGGA GGATGGCACG CCGCTGCTGG TGGATACGCC GATCGCCGGC AGCGACCCGG ACCTCGCCCT GGCTTTGCAA GAGGCCGTGC TTCCCACGGA CGACCTGGCC GAACCAGGGC GCAGCTTCTT CGCCTATGCC ACGGTGTCCG GCGAGCGCGT GGGCTACGGC GGCTTCGAGC GCCTGGGCCG AGACGTGCTC GTGCGCTCCC TGGTGGTGCT GCCACACGCA CGCCATCGCG GCATCGGCGG CGGAATGTTC GCACTGCTGC TGCGCCGTGC CTTCGACGAG GGCGGGCGCG ACGCCTGGCT ACTGACCACG ACGGCGGCGC CGTTCTTCGA GCGCGCGGGC TTCAAGCCAA TCGAGCGCAG CGCCGCACCG GCGGCCATCC TTGCCACACG GCAGGCCGCA AGCCTTTGTC CTTCGAGCGC CGTGCTGCTT GGACGTCGAA TGTCACTGTG A
|
Protein sequence | MSTVTIYHNP DCGTSRNTLA LIRASGIEPT VIEYLKTPPD RETLKALIAR MGMGVRDVLR IKGTPYKELG LDAAHWSDDQ LIDQMLAYPI LINRPIVVSR SGVRLCRPSD TVIDLLPQRP AGENRKEDGT PLLVDTPIAG SDPDLALALQ EAVLPTDDLA EPGRSFFAYA TVSGERVGYG GFERLGRDVL VRSLVVLPHA RHRGIGGGMF ALLLRRAFDE GGRDAWLLTT TAAPFFERAG FKPIERSAAP AAILATRQAA SLCPSSAVLL GRRMSL
|
| |