Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2042 |
Symbol | |
ID | 5835704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2277714 |
End bp | 2279162 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641367840 |
Product | homospermidine synthase |
Protein accession | YP_001639509 |
Protein GI | 163851466 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG5310] Homospermidine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.418846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGATG GCCCGAAGAA GGATTGGCCC GTCCACGGTC GCATCACCGG CCCGATCGTG ATGATCGGTT TCGGCTCGAT CGGCCGGGGC ACCCTGCCCC TCATCGAGCG CCATTTCGAG TACGAAAAGA GCCGCTTCAC GGTCATTGAT CCCGTCGACA CCCATAAGGA TCTCGCCGAC AAGCACGGCC TGCGCTTCGA GAAGGTCGCG CTCACCAAGG AAAACTACCG CGACGTCCTC ACCCCGCTCC TCACCGAGGG CGGCGGCCAG GGCTTCTGTG TGAACCTGTC GGTGGACACC TCCTCGCGCG ACATCCTCGA ACTCTGCCGT GAACTCGGCG CGCTCTACAT CGACACGGTC GCCGAGCCCT GGACCGGCTT CTACTTCGAC AAGGACCTGA GCCAGGCCGA CCGCACCAAC TACGCGCTGC GCGAGAACAT CCTCGCCGCC CGCCGCGCCG CTCCCGGCGG CACGACCGCG GTGTCGTGCT GCGGTGCAAA CCCTGGCATG GTCTCGTGGT TCGTCAAGCA GGCGCTGCTC AACGTCGCCC AGGACACCGG CTCCTCGACC CCTGAGCCGA AGACCCGCGA GGAATGGGCC GCGCTCATGC GCGAACTCGG CGTCAAGGGC GTCCACATCG CCGAGCGCGA CACCCAGCGC GCCAAGAACC CCAAGCCCCA GGGTGTGTTC GTCAACACGT GGTCGGTCGA GGGCTTCGTC TCCGAGGGCA ACCAGCCGGC CGAACTCGGC TGGGGCACGC ACGAGACCTG GAAGCCCGCC AATGCCCAGG AGCAGACCAA GGGCTCGCGC TGCGCGATCT TCCTGCTGCA GCCCGGCGCC GACACCCGCG TGCGCTCCTG GACGCCGACC GCGCAGGCGC AGTTCGGCTT CCTCGTGACC CACAACGAGG CGATCTCGAT CGCGGATTAC TACACGGTCC GCGAGGGTAA CGAGGCGGTC TATCGCCCGA CCTGCCACTA CGCCTACCAC CCGGCCAACG ACGCCGTGCT CTCGCTGCAC GAGATGTGGG GCAATGCCGG CAAGGTGCAG GAGCACCAGC ACATCCTCGA CGAGAACGAG ATCGTCGACG GCATCGACGA ACTCGGCGTC CTCCTCTACG GGCATAAAAA GAACGCCTAC TGGTACGGCT CGCAGCTCTC CATCGAGGAG ACCCGGCGGA TCGCCCCCTA CCAGAACGCG ACCGGCCTTC AGGTGACGAG CGCCGTGCTC GCCGGCATGG TCTGGGCGCT GGAGAACCCG GAGGCCGGCA TCGTCGAGGC CGACGAGATC GACTTCCGCC GCTGCCTGGA AGTGCAGACG CCGTATCTCG GTCCGGTCGT GGGCGTTTAC ACCGACTGGA CCCCGCTCAC CGACCGTCCG GGCCTGTTCC CGGAGGACAT CGACCCGAGC GACCCCTGGC AGTTCCGCAA CGTGCTCGTG CACGGCTGA
|
Protein sequence | MSDGPKKDWP VHGRITGPIV MIGFGSIGRG TLPLIERHFE YEKSRFTVID PVDTHKDLAD KHGLRFEKVA LTKENYRDVL TPLLTEGGGQ GFCVNLSVDT SSRDILELCR ELGALYIDTV AEPWTGFYFD KDLSQADRTN YALRENILAA RRAAPGGTTA VSCCGANPGM VSWFVKQALL NVAQDTGSST PEPKTREEWA ALMRELGVKG VHIAERDTQR AKNPKPQGVF VNTWSVEGFV SEGNQPAELG WGTHETWKPA NAQEQTKGSR CAIFLLQPGA DTRVRSWTPT AQAQFGFLVT HNEAISIADY YTVREGNEAV YRPTCHYAYH PANDAVLSLH EMWGNAGKVQ EHQHILDENE IVDGIDELGV LLYGHKKNAY WYGSQLSIEE TRRIAPYQNA TGLQVTSAVL AGMVWALENP EAGIVEADEI DFRRCLEVQT PYLGPVVGVY TDWTPLTDRP GLFPEDIDPS DPWQFRNVLV HG
|
| |