Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_5276 |
Symbol | |
ID | 8450908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 5890907 |
End bp | 5894194 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645044308 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003204531 |
Protein GI | 258655375 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCGAC TGAGCGTCCT GAACTGGCCG GTGGTCCGGC AGTTCCGGTC GGGCGACGTG TTCGGCCGTG GGCCGGCAGT CACTTCGGCG CGCACCCGGG AGCTGACTCC CCGCACCTCC ACCGCCGACC GGGTCGCCCG CAGCGTGTGC CCCTACTGCG CCGTCGGCTG CGGGCAGAAG GTGTTCGTCA AGGACGAAAA AGTCGTCCAG ATCGAGGGCG ATCCGGACTC GCCGATCTCC CGGGGCCGGT TGTGCCCCAA GGGGTCGGCC AGTGAGCAGC TGGTCAACTC ACCCGCCCGG CAGACCCGGA TCCTGTACCG CCCGCCGCAC GCCACGGACT GGCAACCGCT GGAGCTGGAC ACGGCGATCG ACATGATCGC CGACCGGTTC CTGGCCGCCC GCCGGGCCCA CTGGCAGGAC GCCGACGACG AGGGTCGACC GCTGCGCCGG ACGATGGGCA TCGCCGCCCT CGGCGGGGCC ACGCTGGACA ACGAAGAGAA CTACCTGATC AAGAAGCTGT TCACCGCGGC CGGCGCGATC CAGGTCGAGA ACCAGGCCCG TATTTGACAC TCCGCCACGG TTCCCAGTTT GGGAGCCTCG TTCGGACGCG GCGGCGCGAC GCAATCGTTG CAGGACATGG CCAATGCCGA CTGCATCGTG ATCCAGGGCT CCAACATGGC CGAGTGCCAC CCGGTGGGCT TCCAGTGGGT GTCCGAGGCC AAGGCGCGCG GCGCCCGGAT CATCCACGTC GACCCCCGCT TCACCCGAAC GAGCGCGATC GCGCACAAGC ACATCCCGAT CCGGGCCGGA TCCGACGTGG TGCTGCTGGG TGCGTTGATC AAGCACGTGC TGGACAACGA CCTGTGGTTC CACGAGTACG TCGTGCACTA CACGAACGCG GCGACCATCG TGAACGAGGA CTTCCGCGAC GCCGAGGACC TGGGCGGGCT GTTCTCCGGG TTCGACCCGG CGACCGGGAA GTACGACCTG TCGTCCTGGG CCTACGAGAC CGCTGATGAT GCGCACGAGG ACGCCGCCGA GCACGGCGCC TCCGCGTCCG ACCGGGCGGC CGGTGACGAG CACGGCAGCG GTGGGCCGAC CCTGCCGCAC GCCAAGGTGC TGCGCGACGA CACCCTGCAG CACCCGCGGT GCGTGTTCCA GATCCTTCGG CGGCACTACG CCCGCTACAC CCCGGCGATG GTGCAGGAGG CCTGCGGGAT CAACCCGGCC GACTTCCGGT ACCTGGCCGA CTCGATCACT CAGAATTCCG GGCGCGACCG GACCACCTGC TTCGCCTACG CGGTCGGCTG GACCCAGCAC ACGCTGGGTG CCCAGTTCAT CCGCACGTCG GCGATCCTGC AGCTGTTGCT GGGCAACATG GGTCGCCCGG GCGGCGGGAT CATGGCCCTG CGCGGGCACG CCAGCATCCA GGGCTCGACC GACATCCCCA CCCTGTTCAA CCTGCTGCCC GGGTACCTGC CGATGCCGAC GGTCGGCCGG CACGACACCT ACCAGGAGTA CCTGGACGCC ATCTCGTCCA AGCAGCAAAA GGGGTTCTGG GCGGCCGCCG ACACCTACGT GGTCAGCCTG CTCAAGGCCT GGTGGGGGAA CGCGGCGACG GCCGAGAACG ACTGGGCCTA CGACTATCTG CCCCGGCTGT CCGGGCCGCA CGGCACCTAC CAGACCGTGC ACGACATGCT CGAGGACAAG GTCGACGGCT ACTTCATCCT CGGCCAGAAC CCGGCCGTCG GGTCGGCCAA CGGCCGGATG CAGCGCCTCG GCATGGCCCA CCTCAAGTGG CTGGTGGTGC GCGATCTGAA CCTGATCGAG TCGGCCACCT GGTGGAAGGA CGGCCCGGAG ATCGCCTCCG GTGAGCTGAC CACCGAGGGC ATCGGCACCG AGGTGTTCTT CCTGCCGGCG GCCACCCACG TGGAGAAGGC GGGCTCGTTC ACCCAGACCC AGCGGCTGGT GCAGTGGCGG GAGAAGGCGG TCGACCCGCC CGGGCAGGCG CAGGCCGAGC TGGAGTTCTT CTACGAGCTG GGCCAGCGGA TCCGGGCCAA GCTGGCCGGC TCGACCGACC CGCGGGACCG CCCGCTGCTC GACCTGACCT GGGACTACCC GCTGGACGAG CACGGCGAGA TCGACCCGGA ATCGGTGCTG CGGGAGATCA ACGGCTACCG GTTGACCGGC CCCGACGCCG GGGTGCCGAT CTCGTCCTAC CTGCAGCTGC GCGCGGACGG CAGCACGGTG GGCGGGTGCT GGATCTACGC CGGGGTGTAC GCCGACGGCG TCAACCACGC GGCCGACCGG GTGCCCCACG GCGGGCCGAG CCCGAGCCAG AACGAGTGGG GCTGGGCGTG GCCGGCGAAC CGGCGGATCC TGTACAACCG GGCCTCGGCG GACCCCGCGG GCCGGCCGTG GAGCGAACGC AAGAAGCTGG TCTGGTGGGA CGAGCAGCAG CAGCGCTGGG TCGGCAATGA CGTGCCCGAC TTCGTCGTCG ACCGGGCCCC GGGCAGCCGT CCCGACCCGG AACTGGGCGG CGCCGCCGCG TTGGCCGGCG ACGACCCGTT CGTCATGCAG GCGGACGGCA AGGGCTGGCT GTTCGCGCCC AAGGGCATGC TCGACGGCCC GTTGCCCACG CACTACGAAC CACAGGAGTC CCCGGTCCGC AACGCGCTGT ACCCGCAGCA GCAGAGCCCG TCGCGGATCC TGATGCCCGG CAAGGACAAC CTGTGGGCGC CCAGCGCCGG TCAGCCCGGG GCCGGGGTCT ACCCGTACGT GTTCACCACC TACCGGCTGA CCGAACACCA CACCGCGGGC GGGATGAGCC GCTGGCTGCC GTTCCTGGCC GAGCTGCAGC CGGAGATGTT CTGCGAGGTC TCCCCGGAGC TGGCCGCCGG GAAGGGCCTG GAGAACTTCG GCTGGGCGAC CATCATCTCG CCCCGGTCGG CGATCGAGGC CAAGGTGCTG GTCACCGACC GGATGACGCC GCTGACGATC GGCGGGCACA CCATCCACCA GATCGGGCTG CCCTACCACT GGGGCGTCGG CGGCGACGCC GTGGTCAGCG GGGACGCGGC CAACGACCTG CTCGGCATCA CCCTGGATCC GAATGTGCAG ATCCAGGAGT CCAAGGCCGG GTCGTGCGAC ATCCGGCCCG GCCGCCGGCC GCAGGGCGAG GACCTGCTGC GACTGGTCGC CGAGTACCAG TCCCGGTCCG GCGCTACCGT CGAGACCGAC AACGAGCGGG TCACCGACCC CGACCGTGAG GGCGAGGGGC GGCGCTGA
|
Protein sequence | MGRLSVLNWP VVRQFRSGDV FGRGPAVTSA RTRELTPRTS TADRVARSVC PYCAVGCGQK VFVKDEKVVQ IEGDPDSPIS RGRLCPKGSA SEQLVNSPAR QTRILYRPPH ATDWQPLELD TAIDMIADRF LAARRAHWQD ADDEGRPLRR TMGIAALGGA TLDNEENYLI KKLFTAAGAI QVENQARIUH SATVPSLGAS FGRGGATQSL QDMANADCIV IQGSNMAECH PVGFQWVSEA KARGARIIHV DPRFTRTSAI AHKHIPIRAG SDVVLLGALI KHVLDNDLWF HEYVVHYTNA ATIVNEDFRD AEDLGGLFSG FDPATGKYDL SSWAYETADD AHEDAAEHGA SASDRAAGDE HGSGGPTLPH AKVLRDDTLQ HPRCVFQILR RHYARYTPAM VQEACGINPA DFRYLADSIT QNSGRDRTTC FAYAVGWTQH TLGAQFIRTS AILQLLLGNM GRPGGGIMAL RGHASIQGST DIPTLFNLLP GYLPMPTVGR HDTYQEYLDA ISSKQQKGFW AAADTYVVSL LKAWWGNAAT AENDWAYDYL PRLSGPHGTY QTVHDMLEDK VDGYFILGQN PAVGSANGRM QRLGMAHLKW LVVRDLNLIE SATWWKDGPE IASGELTTEG IGTEVFFLPA ATHVEKAGSF TQTQRLVQWR EKAVDPPGQA QAELEFFYEL GQRIRAKLAG STDPRDRPLL DLTWDYPLDE HGEIDPESVL REINGYRLTG PDAGVPISSY LQLRADGSTV GGCWIYAGVY ADGVNHAADR VPHGGPSPSQ NEWGWAWPAN RRILYNRASA DPAGRPWSER KKLVWWDEQQ QRWVGNDVPD FVVDRAPGSR PDPELGGAAA LAGDDPFVMQ ADGKGWLFAP KGMLDGPLPT HYEPQESPVR NALYPQQQSP SRILMPGKDN LWAPSAGQPG AGVYPYVFTT YRLTEHHTAG GMSRWLPFLA ELQPEMFCEV SPELAAGKGL ENFGWATIIS PRSAIEAKVL VTDRMTPLTI GGHTIHQIGL PYHWGVGGDA VVSGDAANDL LGITLDPNVQ IQESKAGSCD IRPGRRPQGE DLLRLVAEYQ SRSGATVETD NERVTDPDRE GEGRR
|
| |