Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3375 |
Symbol | |
ID | 5834693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3740387 |
End bp | 3741616 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641369174 |
Product | formamidase |
Protein accession | YP_001640832 |
Protein GI | 163852789 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGA CCCTGATCAA GGTCGATCTC ACGCAGTCGG CCTACGACAA CGACATGGTG CACAACCGCT GGCACCCCGA CATTCCGATG GTCGCGACCG TGAAGCCCGG CGACGACTTC ATCGTCGAGA CCTATGACTG GACCGGCGGC TTCATCAAGA ACAACGATTC CGCCGACGAC GTGCGCGACA TCGACCTGTC GATCGTGCAC TTCCTGTCGG GCCCGATCGG CGTCGAGGGT GCCGAGCCCG GCGACCTGCT CGTGGTCGAT CTGCTCGATA TCGGCGCCAA GCCCGAGAGC CAGTGGGGCT TCAACGGCTT CTTCTCGAAG AACAACGGCG GCGGCTTCCT CGACGAGCAC TTCCCCCAGG CCCAGAAGTC GATCTGGGAT TTCGAGGGCA TGTTCACCAA GTCGCGCCAC GTGCCTGGCG TGCGCTTCCC CGGCCTGATC CATCCCGGCC TGATCGGCTG CTTGCCCGAT CCGAAGATGC TGGAGACCTG GAACACCCGC GAGAAGGCGC TCTACGACAC CAACCCGAGC CGGGTGCCGG CGCTGGCCAC CCTCCCCTTC GGGCCGACCG CGCATATGGG CCGGCTGAAG GGTGACGCGA AGGACAATGC CGCGGCCACG GGCGCCCGTA CGGTGCCGGG GCGCGAGCAT GGCGGCAATT GCGACATCAA GGATCTGTCG CGCGGCTCGA AGATCTACTT CCCCGTCTAC GTGCCCGGCG CCGGCCTCTC GATGGGCGAC CTGCATTTCA GCCAGGGTGA CGGCGAGATC ACCTTCTGCG GCGCTATCGA GATGGCGGGC TGGGTTCACC TCAAGGTGAG CCTGATCAAG GACGGCATGG CGAAGTACGG GATCAAGAAC CCGATCTTCA AGCCGTCGCC GATCACCCCG AAATACGACG ACCACCTGAT CTTCGAGGGC GTCTCGGTCG ACGAGTACGG CAAGCAGCAT TACCTCGACG TCACGGTCGC CTACCGCCAA GCCTGCCTCA ACGCGATCGA GTACCTGAAG AAGTTCGGCT ACTCGGGCGC CCAGGCCTAC TCGATCCTCG GCACCGCCCC GGTCCAGGGC CACATCTCCG GCGTCGTCGA TATCCCGAAC GCCTGCGCCA CGCTCTGGAT TCCGACCGGC ATCTTCGACT TCGACATCAA CCCGTCCGAG GCCGGGCCGA CGAAGTTCCT CGACGGCTCG ATCCAGATGC CGCTCTCGCC GGATCTCTGA
|
Protein sequence | MPETLIKVDL TQSAYDNDMV HNRWHPDIPM VATVKPGDDF IVETYDWTGG FIKNNDSADD VRDIDLSIVH FLSGPIGVEG AEPGDLLVVD LLDIGAKPES QWGFNGFFSK NNGGGFLDEH FPQAQKSIWD FEGMFTKSRH VPGVRFPGLI HPGLIGCLPD PKMLETWNTR EKALYDTNPS RVPALATLPF GPTAHMGRLK GDAKDNAAAT GARTVPGREH GGNCDIKDLS RGSKIYFPVY VPGAGLSMGD LHFSQGDGEI TFCGAIEMAG WVHLKVSLIK DGMAKYGIKN PIFKPSPITP KYDDHLIFEG VSVDEYGKQH YLDVTVAYRQ ACLNAIEYLK KFGYSGAQAY SILGTAPVQG HISGVVDIPN ACATLWIPTG IFDFDINPSE AGPTKFLDGS IQMPLSPDL
|
| |