Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1157 |
Symbol | |
ID | 5833939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1267660 |
End bp | 1268769 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641366950 |
Product | Nitrilase |
Protein accession | YP_001638630 |
Protein GI | 163850587 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.932259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.338424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTCC AATACCCGAA GTTCAAGGCG GCCGCCTGCC ACGTCGCCTC GGTCTTCCTC GACAGCACCG CCTCCGCTGA GAAGGCCGTT GCGCTGATCG GCGAGGCCGC CCGCGCGGGC GCCGACCTCG TGGTCTTTCC CGAGGGCTAC ATGCCGGGCT TCCCGCTCTG GGCGGCTCTG CGGGCGCCGA TCCACAACCA CGATCTGTTC AAGCGCCTCG CGGCCCAGTC CGTGCGCCTG GACGGCCCGG AGATCGGCGC GGTCCGCGCC GCCGCCCGGC GCCACGGCGT GCTCGTCTCG CTCGGGTTCA GCGAGAGCAC CGAGGCCAGC GTCGGCTGCC TGTGGAACGC GAACGTGCTG ATCGGGCGCG ACGGGGCGAT CCTCAACCAC CACCGCAAGC TCGTGCCGAC CTTCTACGAA AAGCTCATCT GGGCGAACGG CGACGCCCGG GGCTTGCGCG TGACGCGCAC CGAGATCGGC CGCGTCGGCA TGCTGATCTG CGGTGAGAAC ACCAATCCGC TGGCCCGGTA CACGCTGATG GCCCAGGGCG AGCAGGTCCA CATCTCGACC TACCCGCCGG CTTGGCCGAC GCGCCCGCCG GGAGAGAGCG CCGCCTACGA CCTGAAACGG GCCATCGAGA TCCGTGCCGG GGCGCATGCT TTCGAGGCCA AGGTGTTCAA CATCGTCTGC TCCGCCGTCC TCGACGCGGC CGCCAGGGCC ACCCTCTGCG ACGGGGACGC TGCCCTCGCC GAACTCGTCG AGCGGACCCC CGCGGGCGTG TCCATGGTCC TCGACCCCAC CGGCTCCCAT GTCGTCGAGC CGCACCAGGG GGACGAGACG ATCGTCTACG CCGACATCGA CGTCGAGGCC TGCGTCGAGC CCAAGCAGTT CCACGACGTC GTCGGCTACT ACAACCGCTT CGACATCTTC CGCCTCCATG TCGACCGCAC GCCGCGCGAG CCGATCAGCT TCGACGCGGC CGCCCGGCCG TCGGGCGTTG CCGCCGACGG CGTCGATGGG CTCGAGGCCC TCGATCCGGA TGGCGCCCGC GCACAGCCCG GCCTTGGCGA GGCGGCGCCG GCCCCGCCGC TTCGCCGCGC AGGCCACTGA
|
Protein sequence | MSVQYPKFKA AACHVASVFL DSTASAEKAV ALIGEAARAG ADLVVFPEGY MPGFPLWAAL RAPIHNHDLF KRLAAQSVRL DGPEIGAVRA AARRHGVLVS LGFSESTEAS VGCLWNANVL IGRDGAILNH HRKLVPTFYE KLIWANGDAR GLRVTRTEIG RVGMLICGEN TNPLARYTLM AQGEQVHIST YPPAWPTRPP GESAAYDLKR AIEIRAGAHA FEAKVFNIVC SAVLDAAARA TLCDGDAALA ELVERTPAGV SMVLDPTGSH VVEPHQGDET IVYADIDVEA CVEPKQFHDV VGYYNRFDIF RLHVDRTPRE PISFDAAARP SGVAADGVDG LEALDPDGAR AQPGLGEAAP APPLRRAGH
|
| |