Gene Clim_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2097 
Symbol 
ID6355075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2312955 
End bp2314229 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID642669692 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_001944104 
Protein GI189347575 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTG ACTGTTCGAT CGATATTCGC CATCTGACAA GGGTTGAAGG CCACGGAAAT 
ATCCGGATTA CGGTAAGCGG AGGAAAACTG CTGGAAGCCC GGTGGGCGGT TGTTGAAACC
CCGAGGTTTT TCGAGGTGAT GGTCAAAGGC ATGAGCGCCG AACGGGTGCC ATTTCTCACC
TCGCGCATCT GCGGCATCTG TTCGATCAGC CATGCCCTGG CGAGCATCAG GGCGCTCGAA
CGGGCTATGC TGATCGCCCC GCCTCCAGCT GCGGAAACAA CCAGGCTGCT TGCCATGCAC
GGAGAAACCC TGCAGAGCCA CGCGCTGCAC CTGTTTTTTC TTGCCGCACC GGATTTCGCC
GGCACGTCGG GTGTACTGCC TCTGCTGGAG TCGCAACCGG AACTGGTCAG GGCCGGTCTC
GGGCTCAAGG AACTCGGCAA CGAAATCAGC GCCGTAACAA CGGGACGGTG CACCCATCCG
GTCAGCCTCG TGGTGGGAGG GCTCAGCAAG GCGCCAGACA AAATTCGGCT GCAGCAGCTC
CTCGACATGA TCGGTGAACG GAAGTCCGCG CTCGGCATTG CCTGCGATTT CTTCGGTACC
CTCGATATTC CCCGGTTCGA GCGTGAAACC GAATTCATCT CGCTCCACAA CGGCGCAACC
TACCCCTTCA TCGGAGGCGA CCTGCTCTCC ACCGACGGCG TCAGGAAAGA AGAGAACGAC
TACCTCCCGA TGACGAACGA GTACGTCGCA GAATTCTCCA CCTCGAAGTT CACCCGGTGC
AGCCGCGAGT CATCGGCGGC GGGAGCGCTC GCACGCTTCA ACAACAACAG CGGATTCCTG
CACCCCGAAG CGAAAAAAGC CGCCGAAAAA CTGGGACTCA GGCCGATCTG CCACAACCCC
TTCATGTGCA ACATCACGCA GCTCGTCGAG TGCGTGCACA TCCTCTACGA CGCAGAAACG
CTCATCCAGA AATTGCTCGA CACCGACCTT TCCGATATCC GCACCCCGTT CGCCCCGAAG
GCAGGCATCG CAACGGGAGC CGTCGAGGCG CCCCGCGGCA TCCTCTACCA CCACATGGAA
ACCGATGAGG AGGGCAAGGT AGTGAAAGCC GACTGCATCA TTCCCACCAC GCAGAACAAC
GCCAATATCC ACAACGACCT GCAGGCCCTT GCCAGGCAGG CGTTCGAAGA GGGAAAAAAC
GACCGGGAGA TCGAAAAACT CGCCGAAATG CTGGTGCGCT CTTACGACCC CTGCATTTCA
TGCTCGGTGC ACTGA
 
Protein sequence
MKRDCSIDIR HLTRVEGHGN IRITVSGGKL LEARWAVVET PRFFEVMVKG MSAERVPFLT 
SRICGICSIS HALASIRALE RAMLIAPPPA AETTRLLAMH GETLQSHALH LFFLAAPDFA
GTSGVLPLLE SQPELVRAGL GLKELGNEIS AVTTGRCTHP VSLVVGGLSK APDKIRLQQL
LDMIGERKSA LGIACDFFGT LDIPRFERET EFISLHNGAT YPFIGGDLLS TDGVRKEEND
YLPMTNEYVA EFSTSKFTRC SRESSAAGAL ARFNNNSGFL HPEAKKAAEK LGLRPICHNP
FMCNITQLVE CVHILYDAET LIQKLLDTDL SDIRTPFAPK AGIATGAVEA PRGILYHHME
TDEEGKVVKA DCIIPTTQNN ANIHNDLQAL ARQAFEEGKN DREIEKLAEM LVRSYDPCIS
CSVH