Gene DET0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET0147 
Symbol 
ID3230473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp143111 
End bp144832 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content53% 
IMG OID637119715 
Product[Fe] hydrogenase, large subunit HymC, putative 
Protein accessionYP_180897 
Protein GI57235003 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTACGC TAAATATTGA CAACAAGCAG ATAAGCGTAC CTGAAGGTAC GACTATTATG 
CAGGCCGCCA AGCAGGCCAA TATCAATATC CCCCACCTGT GTTATTTTGA AGGCCTGAAA
AGTTACAGCG GCTGCCGGGT ATGTGTGGTG GAAATAGAGG GTGAACCCCG TCTGGCTACG
TCCTGCTCAC GCAAGGTAGC CGAAGGCATG AAGGTTAATA CCCATTCCGC CAGAGTACGC
CGCGCCCGCC GCACCATACT TGAAATCCTG CTGGCCAATC ACCCGCAGGA CTGTTTTAAC
TGTGAGCGCA ACCAGAACTG TGATTTGCTG CGTCTGGCGT TTGAATGCGG AGTTAAAAAG
CTGCGCTTTG AAGAAAGCGA AAAGCGGGTG CTGCCTATAG ACAGTACCAG CCCCAGTATT
ATCCGTGACC CCAATAAATG TATTGCCTGC GGCCGGTGTG TCCGCGTCTG CCACGATATA
CAAACAGTCA ATGCCATCGG TTTTATAAAT AAAGGCCCGG ATACCATGGT GGCAACCTCC
ATGGACAGGG GTATGGGCAA TGTTGCCTGT GCCAACTGCG GCCAGTGCAT ACTGGTCTGC
CCGGTGGGTG CTATCAAGGA ACGCTCGGCG GTGGATGCTG TCTGGGCGGC TATAGCAGAC
CCCACTAAAC ACGTGGTTGT TCAGGAAGCT CCTTCGGTCA GGGTTTCTCT GGGCGAAGAG
CTGGGCTTGC CGGCAGGTAC GCTGGTTGCC AAAAAGATGT ATGCCGCTTT AAGGCGTCTG
GGTTTTGACG CCGTATTTGA TACCAACTTT ACCGCTGACC TGACCATTAT GGAAGAGGGT
TCGGAACTGG TGGAACGGGT TAAGGACGGC GGGGTGCTTC CCCAGATAAC CTCCTGCTGC
CCCGGCTGGG TCAAGTTTAT GGAGCATTAT TATCCTGAAC TTGCGCCCAA CGTTTCCTCC
GCCAAGTCCC CCCAGCAAAT GTTCGGGGCG GTCTGCAAGA CCTATTATGC CGAAAAATCC
GGCATAGACC CCAAGGATAT TATCAATGTT TCGGTCATGC CCTGTACGGC CAAGAAATTT
GAGTGCCAGC GTCCCGAAAT GAATGACAGC GGCTTTAAAG ACGTGGATTA TGTCTTGACT
ACCCGTGAGC TGGCCCGGAT GATAAAAGAA GCCGGACTGG ATTTTGCTTC ACTGGACGAA
GAGCCTGCCG AAGACTTGCT GGGTCTTTAT ACCGGTGCCG CCACTATCTT CGGGGCTACC
GGCGGTGTTA TGGAAGCGGC TATCCGCAGT GCCTACACCC TGATAACCGG GCGCGAACTG
GAAAACCTGG ATATAGAACC GGTGCGCGGT CTGGAAGGCA TTAAGACCGC CAGCGTTAAT
ATTGACGGGT TAGAGGTTAA AGTAGCGGTG GCTCACGGGC TGGGAAATGC CCGTCACCTG
CTGGATGAGA TAAAAGAGGG TGTTTCGCCC TACCACTTTA TAGAAATCAT GGCCTGTCCC
GGCGGTTGTG TCGGCGGCGG CGGCCAGCCA ATACGCTTTG ATTCCACTCT CAAGAAAAAG
CGCGGCGAAG CCCTTTACGA AGAAGACAGA AACATGGCCA AGAGGTGTTC CCACCACAAC
CCGTCAGTAG AGAAGATATA TAAAGACTAT CTGGAGAAGC CGCTGGGCAA GCGTTCTCAC
AAACTGCTGC ATACCGAATA TACCAGTCGC CCGGTAGTTT AA
 
Protein sequence
MVTLNIDNKQ ISVPEGTTIM QAAKQANINI PHLCYFEGLK SYSGCRVCVV EIEGEPRLAT 
SCSRKVAEGM KVNTHSARVR RARRTILEIL LANHPQDCFN CERNQNCDLL RLAFECGVKK
LRFEESEKRV LPIDSTSPSI IRDPNKCIAC GRCVRVCHDI QTVNAIGFIN KGPDTMVATS
MDRGMGNVAC ANCGQCILVC PVGAIKERSA VDAVWAAIAD PTKHVVVQEA PSVRVSLGEE
LGLPAGTLVA KKMYAALRRL GFDAVFDTNF TADLTIMEEG SELVERVKDG GVLPQITSCC
PGWVKFMEHY YPELAPNVSS AKSPQQMFGA VCKTYYAEKS GIDPKDIINV SVMPCTAKKF
ECQRPEMNDS GFKDVDYVLT TRELARMIKE AGLDFASLDE EPAEDLLGLY TGAATIFGAT
GGVMEAAIRS AYTLITGREL ENLDIEPVRG LEGIKTASVN IDGLEVKVAV AHGLGNARHL
LDEIKEGVSP YHFIEIMACP GGCVGGGGQP IRFDSTLKKK RGEALYEEDR NMAKRCSHHN
PSVEKIYKDY LEKPLGKRSH KLLHTEYTSR PVV