Gene Dbac_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_1970 
Symbol 
ID8377643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp2264138 
End bp2265085 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content55% 
IMG OID645001195 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_003158474 
Protein GI256829746 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTTA GCAGACGTGA GTTCGTAAAA CTGTGCTCCG CAGGTGTCGC CGGATTGGGA 
ATTTCCCAGA TTTATCATCC GGGCATCGTG CACGCCATGA CCGAAGGAGC CAAAAAAGCT
CCGGTCATCT GGGTACAGGG ACAGGGTTGT ACTGGTTGCT CCGTTTCTCT GCTCAACGCA
GTCCATCCCA GAATCAAGGA GATTCTGCTG GATGTGATCA GCCTTGAGTT CCATCCCACC
GTCATGGCAA GTGAAGGTGA GATGGCATTG GCGCATATGT ACGAAATTGC TGAAAAGTTT
AACGGCAACT TTTTCTTGCT GGTGGAAGGT GCCATCCCCA CCGCCAAGGA AGGTCGCTAC
TGCGTTGTCG GTGAAACTCT GGATGCCAAA GGGCATCATC ATGAAATCAC CATGATGGAA
CTGATCCGGG ATCTGGCACC CAAGTCTCTG GCCACCGTGG CCATAGGTAC TTGTGCCGCT
TACGGCGGCA TTCCCGCGGC TGCAGGCAAC GTCACCGGCT CCAAGAGCGT GCGTGACTTC
TTTGCCGAAG AGAAGATCGA AAAACTGCTG GTCAACGTGC CCGGATGTCC GCCCCATCCG
GACTGGATGG TCGGCACTCT GGTTGCCGCA TGGAGCCATG TCCTCAATCC GACCGAGCAT
CCCCTGCCCG AATTGGATGA TGACGGCCGC CCGCTGCTGT TCTTTGGCGA CAACATCCAC
GAGAACTGTC CGTATCTTGA TAAATACGAC AACTCCGAAT TCGCGGAAAC CTTCACCAAG
CCGGGCTGCA AGGCCGAACT TGGCTGCAAG GGTCCGTCCA CCTATGCCGA TTGCGCCAAG
CGTCGCTGGA ACAACGGCAT AAACTGGTGT GTCGAGAACG CCGTGTGTAT CGGCTGTGTG
GAACCGGACT TTCCGGACGG AAAGTCTCCT TTCTATGTAG CGGAATAA
 
Protein sequence
MSLSRREFVK LCSAGVAGLG ISQIYHPGIV HAMTEGAKKA PVIWVQGQGC TGCSVSLLNA 
VHPRIKEILL DVISLEFHPT VMASEGEMAL AHMYEIAEKF NGNFFLLVEG AIPTAKEGRY
CVVGETLDAK GHHHEITMME LIRDLAPKSL ATVAIGTCAA YGGIPAAAGN VTGSKSVRDF
FAEEKIEKLL VNVPGCPPHP DWMVGTLVAA WSHVLNPTEH PLPELDDDGR PLLFFGDNIH
ENCPYLDKYD NSEFAETFTK PGCKAELGCK GPSTYADCAK RRWNNGINWC VENAVCIGCV
EPDFPDGKSP FYVAE