Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1525 |
Symbol | |
ID | 5833714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1703654 |
End bp | 1704979 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641367323 |
Product | NUDIX hydrolase |
Protein accession | YP_001638995 |
Protein GI | 163850952 |
COG category | [F] Nucleotide transport and metabolism [S] Function unknown |
COG ID | [COG1051] ADP-ribose pyrophosphatase [COG4923] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.103483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATGCT GGGTGGCGGG GCTGGATGGG TGCCGTGGGG CCTGGGCGGG CGCCCTGATC GATCTCGACG ATCCGGCACG CTGGCGCTGC GCCCGCTTCC CGCGGGTGAT CGACCTGCTC GACGGGCCGG AGGCTCCGGT CTGCGTCGGC ATCGACGTGC CGATCGGCCT GCCCGACCGG GTGAGCGGGG GACGCTCGGC CGACCGGGCG GCGCGGGCGT TTCTCGGGGC GGGGCGTTCC AGCGTCTTCC CGGTGCCGCC CCGCGCGGCG GTCTATGCCG GGAGCTACGA CGAGGCGAAG GTCCTGTCGC GGGCGCATTC CGAGCCTCCC TTCGCGCCCT CGATCCAGTG CTGGAACATT CTGCGCTACG TGCGGGAAGC CGACGAACTC CTGCGCGCGC GGTCCGATCT CGTGACGCGC CTGCACGAGG TGCATCCGGA GGTCGCCTTC TTCCGCCTCA ATGGCGAGCA GCGTCTCTCA GCCGGCAAGA AGGGCCCGGC CCGTGCCGAG GGTCTTGCCG CGCGCCGGGC CCTCCTGATC GCCGCCGGGC TGCCCGCGGC GATGGTCGGT TCGCCGCCGC CGCCGGGTGT CGCTGCCGAC GACCATCTCG ACGCCATGGC CGCCCTCGTC GTCGCCCGCG ACATCGCCGA GGGCCGCGCC GAGCCGCTTC CCAACGCGAT CGAGTGCGAC AGCTATGGCC TGCCGATCGT GATCTGGGCG CCCGCGCCCC TTCCTGCGCC GCTTCCGATG CCCTTTCCCG CGCCCTTTGA CCGCGCCCCT TGCCACGAGG AGTTGAGCCC CGTGACCGAT CACCCCGACA CGGATCTGCC GACCCGCGAC ATCGCCCGCG CCCTGGTCTT CGATCCTTCG AACCGCCTTC TCCTGATCGA GTACGAGGCG GTGCGCCCGA TCGATCCGGC CGATCCCGAT GCCCGCGGCT TCTGGTTCAT GCCCGGCGGC GGGCTGGAGC CGGGCGAGAG CCACGAGGAG GCCTGCCGGC GCGAACTCTC GGAGGAGATC GGCGTCGCAG ACGTGGAACT CGGACCCTGC GTCGCCGTCT GCGACGGGCC GTTCCACCTG TTCCGCAAGC CGCGTCATGC CCGCGAGCGC TACTTCGTGG TCCGGCTCGC GAGCGACCGC GTCGATACCA GTCGGCTGGC CGAGACCGAG GACAATCCCG TCCGCGGCAC CCGCTGGTGG CCGCTCGATG AACTCGCGGC TTCCGCCGAG CGTGTGGAGC CCGCGGGACT GGCCAAGCTG GCGCAACGAA TCGCCGCTGG TGACGTTCCG GACCAGCCCG TCCGCCTCAC TTGGCGGGAC GCTTGA
|
Protein sequence | MACWVAGLDG CRGAWAGALI DLDDPARWRC ARFPRVIDLL DGPEAPVCVG IDVPIGLPDR VSGGRSADRA ARAFLGAGRS SVFPVPPRAA VYAGSYDEAK VLSRAHSEPP FAPSIQCWNI LRYVREADEL LRARSDLVTR LHEVHPEVAF FRLNGEQRLS AGKKGPARAE GLAARRALLI AAGLPAAMVG SPPPPGVAAD DHLDAMAALV VARDIAEGRA EPLPNAIECD SYGLPIVIWA PAPLPAPLPM PFPAPFDRAP CHEELSPVTD HPDTDLPTRD IARALVFDPS NRLLLIEYEA VRPIDPADPD ARGFWFMPGG GLEPGESHEE ACRRELSEEI GVADVELGPC VAVCDGPFHL FRKPRHARER YFVVRLASDR VDTSRLAETE DNPVRGTRWW PLDELAASAE RVEPAGLAKL AQRIAAGDVP DQPVRLTWRD A
|
| |