Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2436 |
Symbol | |
ID | 5834010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2739250 |
End bp | 2740155 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641368237 |
Product | CMP/dCMP deaminase zinc-binding |
Protein accession | YP_001639902 |
Protein GI | 163851859 |
COG category | [F] Nucleotide transport and metabolism [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0590] Cytosine/adenosine deaminases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.768998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCTGCA ACAATCCAAA TCACGATCAC AGTCTCACTC GTAAAGGACT GTTGCGGGGG CTATTCGCCG CTGTTGGAGG TCTTGCGGCT GGGGCCGCGG CAACCCGACC CTCCTTAGCC GCAGCCTCCA CGGACGCGAG CGTTTGCTAC TCGTACAATC CGGCGGCAAC CCAGATGTCC GCCATCGCAT GGGGCAAGCC TTACGAGGGT GAGCCTACGT CGCTCAAGGA TCGGCCTCTC TTCGGCTACG TGCAGGAATG GAACAACTTC GATCCGAAGT GCACACTGGA GCCGGAGGCA CTCTGCAAGC AGTTTCCGTC GCACGCGCAC AACATCAAGG CCGCGGGCGT GTACGACGGC ATTGAGGTTG CAGGCAATCA GTGGATGCGC ATGGCCTCTG AGGAGGCTCG CATTTCTGTC GAGAATGGTG GTGGCCCCTT CGGTGCGGTC ATCCTCCAAA TCGATGACGA GACCAACGAA GTCATTCGCT ACTGGCGCAA CCACAATCAC GTCCCGGAAT GGCGCGACCC CACAGCCCAC GCGGAAGTCT CGGCTATACG AGCCGCTTGC CGCGAGCTTG GCGTCTTTAG CCTCGCGAGC ATCAAGAAAG AGGAGTCGAA GTTGCCGCAG AAGGGGGCAA CATCCCACAC AGTTATCTAT TCGTCGGCCG AGCCCTGTCC GATGTGCTAC GCGGCCATCT ATTGGGCGCG TATCCCTAAG CTCGTGTTTG CGGCCACCCG CTACGATGCT GCCGTTCAGG GTGTCGAATT TTCAGACGAG ACACTCTATC TCGAACTGGC TCAGCCCTAT CGAGATAGAA AAGGGGTGAA ATCACTCCAG GCCAGTGTCG ACAACTCGCT AGATGCGTTC AATCTTTGGA AGCGCAGCAA GAAGACTCCC TATTGA
|
Protein sequence | MTCNNPNHDH SLTRKGLLRG LFAAVGGLAA GAAATRPSLA AASTDASVCY SYNPAATQMS AIAWGKPYEG EPTSLKDRPL FGYVQEWNNF DPKCTLEPEA LCKQFPSHAH NIKAAGVYDG IEVAGNQWMR MASEEARISV ENGGGPFGAV ILQIDDETNE VIRYWRNHNH VPEWRDPTAH AEVSAIRAAC RELGVFSLAS IKKEESKLPQ KGATSHTVIY SSAEPCPMCY AAIYWARIPK LVFAATRYDA AVQGVEFSDE TLYLELAQPY RDRKGVKSLQ ASVDNSLDAF NLWKRSKKTP Y
|
| |