Gene Namu_5276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5276 
Symbol 
ID8450908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5890907 
End bp5894194 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content70% 
IMG OID645044308 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_003204531 
Protein GI258655375 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGAC TGAGCGTCCT GAACTGGCCG GTGGTCCGGC AGTTCCGGTC GGGCGACGTG 
TTCGGCCGTG GGCCGGCAGT CACTTCGGCG CGCACCCGGG AGCTGACTCC CCGCACCTCC
ACCGCCGACC GGGTCGCCCG CAGCGTGTGC CCCTACTGCG CCGTCGGCTG CGGGCAGAAG
GTGTTCGTCA AGGACGAAAA AGTCGTCCAG ATCGAGGGCG ATCCGGACTC GCCGATCTCC
CGGGGCCGGT TGTGCCCCAA GGGGTCGGCC AGTGAGCAGC TGGTCAACTC ACCCGCCCGG
CAGACCCGGA TCCTGTACCG CCCGCCGCAC GCCACGGACT GGCAACCGCT GGAGCTGGAC
ACGGCGATCG ACATGATCGC CGACCGGTTC CTGGCCGCCC GCCGGGCCCA CTGGCAGGAC
GCCGACGACG AGGGTCGACC GCTGCGCCGG ACGATGGGCA TCGCCGCCCT CGGCGGGGCC
ACGCTGGACA ACGAAGAGAA CTACCTGATC AAGAAGCTGT TCACCGCGGC CGGCGCGATC
CAGGTCGAGA ACCAGGCCCG TATTTGACAC TCCGCCACGG TTCCCAGTTT GGGAGCCTCG
TTCGGACGCG GCGGCGCGAC GCAATCGTTG CAGGACATGG CCAATGCCGA CTGCATCGTG
ATCCAGGGCT CCAACATGGC CGAGTGCCAC CCGGTGGGCT TCCAGTGGGT GTCCGAGGCC
AAGGCGCGCG GCGCCCGGAT CATCCACGTC GACCCCCGCT TCACCCGAAC GAGCGCGATC
GCGCACAAGC ACATCCCGAT CCGGGCCGGA TCCGACGTGG TGCTGCTGGG TGCGTTGATC
AAGCACGTGC TGGACAACGA CCTGTGGTTC CACGAGTACG TCGTGCACTA CACGAACGCG
GCGACCATCG TGAACGAGGA CTTCCGCGAC GCCGAGGACC TGGGCGGGCT GTTCTCCGGG
TTCGACCCGG CGACCGGGAA GTACGACCTG TCGTCCTGGG CCTACGAGAC CGCTGATGAT
GCGCACGAGG ACGCCGCCGA GCACGGCGCC TCCGCGTCCG ACCGGGCGGC CGGTGACGAG
CACGGCAGCG GTGGGCCGAC CCTGCCGCAC GCCAAGGTGC TGCGCGACGA CACCCTGCAG
CACCCGCGGT GCGTGTTCCA GATCCTTCGG CGGCACTACG CCCGCTACAC CCCGGCGATG
GTGCAGGAGG CCTGCGGGAT CAACCCGGCC GACTTCCGGT ACCTGGCCGA CTCGATCACT
CAGAATTCCG GGCGCGACCG GACCACCTGC TTCGCCTACG CGGTCGGCTG GACCCAGCAC
ACGCTGGGTG CCCAGTTCAT CCGCACGTCG GCGATCCTGC AGCTGTTGCT GGGCAACATG
GGTCGCCCGG GCGGCGGGAT CATGGCCCTG CGCGGGCACG CCAGCATCCA GGGCTCGACC
GACATCCCCA CCCTGTTCAA CCTGCTGCCC GGGTACCTGC CGATGCCGAC GGTCGGCCGG
CACGACACCT ACCAGGAGTA CCTGGACGCC ATCTCGTCCA AGCAGCAAAA GGGGTTCTGG
GCGGCCGCCG ACACCTACGT GGTCAGCCTG CTCAAGGCCT GGTGGGGGAA CGCGGCGACG
GCCGAGAACG ACTGGGCCTA CGACTATCTG CCCCGGCTGT CCGGGCCGCA CGGCACCTAC
CAGACCGTGC ACGACATGCT CGAGGACAAG GTCGACGGCT ACTTCATCCT CGGCCAGAAC
CCGGCCGTCG GGTCGGCCAA CGGCCGGATG CAGCGCCTCG GCATGGCCCA CCTCAAGTGG
CTGGTGGTGC GCGATCTGAA CCTGATCGAG TCGGCCACCT GGTGGAAGGA CGGCCCGGAG
ATCGCCTCCG GTGAGCTGAC CACCGAGGGC ATCGGCACCG AGGTGTTCTT CCTGCCGGCG
GCCACCCACG TGGAGAAGGC GGGCTCGTTC ACCCAGACCC AGCGGCTGGT GCAGTGGCGG
GAGAAGGCGG TCGACCCGCC CGGGCAGGCG CAGGCCGAGC TGGAGTTCTT CTACGAGCTG
GGCCAGCGGA TCCGGGCCAA GCTGGCCGGC TCGACCGACC CGCGGGACCG CCCGCTGCTC
GACCTGACCT GGGACTACCC GCTGGACGAG CACGGCGAGA TCGACCCGGA ATCGGTGCTG
CGGGAGATCA ACGGCTACCG GTTGACCGGC CCCGACGCCG GGGTGCCGAT CTCGTCCTAC
CTGCAGCTGC GCGCGGACGG CAGCACGGTG GGCGGGTGCT GGATCTACGC CGGGGTGTAC
GCCGACGGCG TCAACCACGC GGCCGACCGG GTGCCCCACG GCGGGCCGAG CCCGAGCCAG
AACGAGTGGG GCTGGGCGTG GCCGGCGAAC CGGCGGATCC TGTACAACCG GGCCTCGGCG
GACCCCGCGG GCCGGCCGTG GAGCGAACGC AAGAAGCTGG TCTGGTGGGA CGAGCAGCAG
CAGCGCTGGG TCGGCAATGA CGTGCCCGAC TTCGTCGTCG ACCGGGCCCC GGGCAGCCGT
CCCGACCCGG AACTGGGCGG CGCCGCCGCG TTGGCCGGCG ACGACCCGTT CGTCATGCAG
GCGGACGGCA AGGGCTGGCT GTTCGCGCCC AAGGGCATGC TCGACGGCCC GTTGCCCACG
CACTACGAAC CACAGGAGTC CCCGGTCCGC AACGCGCTGT ACCCGCAGCA GCAGAGCCCG
TCGCGGATCC TGATGCCCGG CAAGGACAAC CTGTGGGCGC CCAGCGCCGG TCAGCCCGGG
GCCGGGGTCT ACCCGTACGT GTTCACCACC TACCGGCTGA CCGAACACCA CACCGCGGGC
GGGATGAGCC GCTGGCTGCC GTTCCTGGCC GAGCTGCAGC CGGAGATGTT CTGCGAGGTC
TCCCCGGAGC TGGCCGCCGG GAAGGGCCTG GAGAACTTCG GCTGGGCGAC CATCATCTCG
CCCCGGTCGG CGATCGAGGC CAAGGTGCTG GTCACCGACC GGATGACGCC GCTGACGATC
GGCGGGCACA CCATCCACCA GATCGGGCTG CCCTACCACT GGGGCGTCGG CGGCGACGCC
GTGGTCAGCG GGGACGCGGC CAACGACCTG CTCGGCATCA CCCTGGATCC GAATGTGCAG
ATCCAGGAGT CCAAGGCCGG GTCGTGCGAC ATCCGGCCCG GCCGCCGGCC GCAGGGCGAG
GACCTGCTGC GACTGGTCGC CGAGTACCAG TCCCGGTCCG GCGCTACCGT CGAGACCGAC
AACGAGCGGG TCACCGACCC CGACCGTGAG GGCGAGGGGC GGCGCTGA
 
Protein sequence
MGRLSVLNWP VVRQFRSGDV FGRGPAVTSA RTRELTPRTS TADRVARSVC PYCAVGCGQK 
VFVKDEKVVQ IEGDPDSPIS RGRLCPKGSA SEQLVNSPAR QTRILYRPPH ATDWQPLELD
TAIDMIADRF LAARRAHWQD ADDEGRPLRR TMGIAALGGA TLDNEENYLI KKLFTAAGAI
QVENQARIUH SATVPSLGAS FGRGGATQSL QDMANADCIV IQGSNMAECH PVGFQWVSEA
KARGARIIHV DPRFTRTSAI AHKHIPIRAG SDVVLLGALI KHVLDNDLWF HEYVVHYTNA
ATIVNEDFRD AEDLGGLFSG FDPATGKYDL SSWAYETADD AHEDAAEHGA SASDRAAGDE
HGSGGPTLPH AKVLRDDTLQ HPRCVFQILR RHYARYTPAM VQEACGINPA DFRYLADSIT
QNSGRDRTTC FAYAVGWTQH TLGAQFIRTS AILQLLLGNM GRPGGGIMAL RGHASIQGST
DIPTLFNLLP GYLPMPTVGR HDTYQEYLDA ISSKQQKGFW AAADTYVVSL LKAWWGNAAT
AENDWAYDYL PRLSGPHGTY QTVHDMLEDK VDGYFILGQN PAVGSANGRM QRLGMAHLKW
LVVRDLNLIE SATWWKDGPE IASGELTTEG IGTEVFFLPA ATHVEKAGSF TQTQRLVQWR
EKAVDPPGQA QAELEFFYEL GQRIRAKLAG STDPRDRPLL DLTWDYPLDE HGEIDPESVL
REINGYRLTG PDAGVPISSY LQLRADGSTV GGCWIYAGVY ADGVNHAADR VPHGGPSPSQ
NEWGWAWPAN RRILYNRASA DPAGRPWSER KKLVWWDEQQ QRWVGNDVPD FVVDRAPGSR
PDPELGGAAA LAGDDPFVMQ ADGKGWLFAP KGMLDGPLPT HYEPQESPVR NALYPQQQSP
SRILMPGKDN LWAPSAGQPG AGVYPYVFTT YRLTEHHTAG GMSRWLPFLA ELQPEMFCEV
SPELAAGKGL ENFGWATIIS PRSAIEAKVL VTDRMTPLTI GGHTIHQIGL PYHWGVGGDA
VVSGDAANDL LGITLDPNVQ IQESKAGSCD IRPGRRPQGE DLLRLVAEYQ SRSGATVETD
NERVTDPDRE GEGRR