Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3266 |
Symbol | |
ID | 6132129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3616867 |
End bp | 3619833 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641643453 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001770105 |
Protein GI | 170741450 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.271284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.20027 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCG GCACCCAGAA CAGCAACCCG GCCGGCGGCG ACCCGCTCGC GACCCGGCCC TCCGCGCCCC AGGATGCCGG CGGCACCGGC GGCACGCCCC AGGGCGGCGG CTCCTACTCG CAGGGCACGA AGGCCGGCGG GCAGGCCGGG CCCGCCCCGA GCGGGACCCT GTCGCTGCGC GACCTGCCGG CGGGCATCCC CGGCGTCGTG TTCGAGCTCG ACGGCGAGCA GGTCGAGGCG CGCCCGGGCG AGACCCTGTG GGCGGTCGCC CAGCGGCTCG GCACCCACAT CCCGCACCTC TGCCACAAGC CCGCCCCGGG CTACCGGCCG GACGGCAATT GCCGCGCCTG CATGGTCGAG ATCGAGGGGG AGCGCGTGCT CGCGGCCTCG TGCAAGCGCA CGCCGGCCGT CGGCATGAAG GTCAAGACCC AGACCGACCG GGCCGTGAAG GCCCGCGCCA TGGTGATGGA GCTCCTCGTC GCCGATCAGC CGGAGCGCGC GACCTCGCAC GATCCGGACT CGCACTTCTG GTCGCAGGCC GACCACGTCG GCGTGAGCGA GAGCCGCTTC CCGGCCGAGG AGCGCTGGGC GACCGACGCG AGCCACCCGG CGATGCGGGT GAACCTCGAC GCCTGCATCC AGTGCAACCT CTGCGTCCGG GCCTGCCGCG AAGTCCAGGT CAACGACGTG ATCGGCATGG CCTACCGCAA CGCGGACGCC AAGGTCGTGT TCGACTTCGA CGACCCGATG GGCTCCTCGA CCTGCGTCGC CTGCGGCGAG TGCGTCCAGG CCTGCCCGAC CGGCGCCCTG ATGCCGGCCG CCTACCTGAA CGATGCGCAG CAGCGCGAGG TCTATCCCGA CCGCTCGGTC GACTCGCTCT GCCCCTATTG CGGCGTCGGC TGCCAGGTCT CCTACAAGGT CAAGGACGAC CGCATCGTCT ACGCGGAGGG CAAGGACGGC CCGGCCAACC ACAACCGGCT CTGCGTGAAG GGCCGCTTCG GCTTCGACTA CGTCCACCAC CCCCACCGGC TGACCAAGCC CCTGATTCGC CTCGACAACG TCGCCAAGGA CGCCAACGAC CAGGTCGATC CGGCCAATCC CTGGACCCAT TTCCGCGAGG CGAGCTGGGA GGAGGCCCTC GCGCGCGCGG CCGGCGGCCT GAGGCGCGTG CGCGACAGCC ACGGACGCCA CGCGCTGGCG GGCTTCGGCT CGGCCAAGGG CTCGAACGAG GAAGCCTACC TGTTCCAGAA GCTGGTGCGG CTCGGCTTCG GCACCAACAA CGTCGACCAT TGCACCCGGC TCTGCCACGC CTCCTCGGTG GCCGCGCTGA TGGAGGGGCT GAATTCGGGC GCCGTCTCGG CCCCGTTCTC GGCGGCGCTC GACGCCGAGG TGATCGTCGT CATCGGCGCG AACCCGACCG TGAACCACCC GGTCGCCGCC ACCTTCATCA AGAACGCCGT CAAGGAGCGC GGCGCCAAGC TGATCATCAT GGACCCGCGC CGGCAGGTCC TGTCGCGCCA CGCCTATCGC CACCTCGCCT TCAAGCCGGG CACCGACGTG GCCATGCTGA ACGCGATGCT GAACGTGATC GTCGAGGAGG GGCTCACCGA TCAGCAGTAC ATCGCCGGCT ACACCGAGAA CTTCGACGCG CTGAAGGACC GCATCCGCGA GTTCACGCCC GAGAAGATGG CGAAGGTGTG CGGCATCCCG GCCGAGACCC TGCGCGAGGT GGCCCGGCTC TACGCCCGCT CGAAGGCCTC GATCATCTTC TGGGGCATGG GCGTGAGCCA GCACGTGCAC GGCACGGACA ATTCCCGCTG CCTGATCGCG CTCGCCCTCA CCACCGGCCA GATCGGCCGG CCCGGCACCG GCCTGCACCC CCTGCGCGGC CAGAACAACG TCCAGGGCGC CTCCGATGCC GGCCTGATCC CGATGGTCTA TCCGGACTAC CAATCCGTCG AGAAGGCGGC GGTGCGGGAA TTGTTCGAGG ACTTCTGGGG CCAGTCCCTC GATCCCAAGC AGGGTCTCAC CGTGGTGGAG ATCATGCGGG CGATCCACGC CGGCAGCATC CGCGGCATGT ACGTCGAGGG TGAGAACCCG GCGATGTCGG ATCCCGACCT CAACCACGCC CGGCAGGCGC TGGCGATGCT CGACCACCTC GTCGTGCAGG ACCTCTTCCT GACCGAGACC GCCTTCCACG CCGATGTGGT GCTGCCGGCC TCGGCCTTCG CCGAGAAGGC CGGCACCTTC ACCAATACCG ATCGCCGCGT GCAGATGGCC CGCCCGGTCG TGCCGCCCCC GGGCGACGCG CGCCAGGATT GGTGGATCAT CCAGGAGATG GCCCGGCGCC TCGGCCTGGC CTGGGATTAT GGCGGCCCGG CCGACATCTT CACCGAGATG GCCCGGGTGA TGCCCTCGCT CAAGAACATC ACCTGGGAGC GGGTGGAGCG CGAGGGCGCC GTCACCTACC CGGTCGACGC GCCGGACGAG CCCGGCCACG AGATCATCTT CTACGCCGGG TTCCCGACCG AGAGCGGGCG CGCCAAGATC GTTCCGGCCG CGGTGACGCC GCCCGACGAG GTGCCGGACA CCGAGTTCCC GATGGTCCTC TCGACCGGCC GCGTGCTGGA GCACTGGCAC ACGGGCTCGA TGACCCGCCG CGCCGGGGTG CTCGACGCGA TCGAGCCCGA GGCGGTGGCC TTCATGGCGC CGCGCGAACT CGGCCGGCTC GGCCTCGTGC CGGGCGACCG GATGCGCCTC GAGACCCGCC GCGGCGCCGT CGAGGTGAAG GTGCGCTCCG ACCGCGACGT GCCGGAGGGG ATGGTGTTCA TGCCCTTCTG CTACGCCGAG GCCGCCGCGA ACCTGCTCAC CAACCCGGCG CTCGACCCGA TGGGCAAGAT CCCGGAGTTC AAGTTCTGCG CCGCCCGGGT GGTGCCCGTC GTTCCGGCCT CGATCGCGGC CGAGTAG
|
Protein sequence | MSSGTQNSNP AGGDPLATRP SAPQDAGGTG GTPQGGGSYS QGTKAGGQAG PAPSGTLSLR DLPAGIPGVV FELDGEQVEA RPGETLWAVA QRLGTHIPHL CHKPAPGYRP DGNCRACMVE IEGERVLAAS CKRTPAVGMK VKTQTDRAVK ARAMVMELLV ADQPERATSH DPDSHFWSQA DHVGVSESRF PAEERWATDA SHPAMRVNLD ACIQCNLCVR ACREVQVNDV IGMAYRNADA KVVFDFDDPM GSSTCVACGE CVQACPTGAL MPAAYLNDAQ QREVYPDRSV DSLCPYCGVG CQVSYKVKDD RIVYAEGKDG PANHNRLCVK GRFGFDYVHH PHRLTKPLIR LDNVAKDAND QVDPANPWTH FREASWEEAL ARAAGGLRRV RDSHGRHALA GFGSAKGSNE EAYLFQKLVR LGFGTNNVDH CTRLCHASSV AALMEGLNSG AVSAPFSAAL DAEVIVVIGA NPTVNHPVAA TFIKNAVKER GAKLIIMDPR RQVLSRHAYR HLAFKPGTDV AMLNAMLNVI VEEGLTDQQY IAGYTENFDA LKDRIREFTP EKMAKVCGIP AETLREVARL YARSKASIIF WGMGVSQHVH GTDNSRCLIA LALTTGQIGR PGTGLHPLRG QNNVQGASDA GLIPMVYPDY QSVEKAAVRE LFEDFWGQSL DPKQGLTVVE IMRAIHAGSI RGMYVEGENP AMSDPDLNHA RQALAMLDHL VVQDLFLTET AFHADVVLPA SAFAEKAGTF TNTDRRVQMA RPVVPPPGDA RQDWWIIQEM ARRLGLAWDY GGPADIFTEM ARVMPSLKNI TWERVEREGA VTYPVDAPDE PGHEIIFYAG FPTESGRAKI VPAAVTPPDE VPDTEFPMVL STGRVLEHWH TGSMTRRAGV LDAIEPEAVA FMAPRELGRL GLVPGDRMRL ETRRGAVEVK VRSDRDVPEG MVFMPFCYAE AAANLLTNPA LDPMGKIPEF KFCAARVVPV VPASIAAE
|
| |