Gene Dhaf_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_3271 
Symbol 
ID7260287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp3500910 
End bp3501860 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content53% 
IMG OID643563192 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_002459725 
Protein GI219669290 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACGAG ATGATCAGCC TATGGAAAGC GGATATCAGC GTCTCGCCAG AAGAGGGGTA 
TCCAGAAGGG ACTTTTTGAA ACTGTGCACA TTCACATGCG CGGCCCTGGG GATCGATTTG
AGCTTAGCGC CTCAAATAGC GGAGGCGGCT GAAGCAAATT TATCCAAAAA ACCGGTGATC
TGGATGCAAG GGCAAGGCTG CACCGGGTGC AGCGAATCTC TGTTATCTTC GGTTGACCCG
GGGCCGGAGG AGATTATTCT CGATCTTCTC TCGGTACGCT ACCATCCTAC CCTGATGGCA
GCCTCAGGAG AGCAGGCTGT TCAAAGCCTG GAAGACTGTA TAGCCCAAGG TCATTACATA
CTCGTCCTGG AGGGCTCGAT TCCCACAGCG GATTCACGGT ATTGCTTTGT GGAAGGAAAG
CCCTTTATAG AACAATTCAA ACTAGCTGCG GAAAAGGCTG AAGCGGTGAT CGCTGTGGGC
TCCTGTGCTT GTTACGGCGG GATTCCCCGT GCCGGGTTCA CCGGGGCAGC GGGCGCCCAG
GAGGTGCTCG AAGGAGTTAA GGTGGTCAAT CTCCCCAGTT GTCCGGTCAA ACCGGATCGG
CTGGTGGGTT TGCTTCTATA CTATCTGAGC CATAATGCTT TGCCTAAGCT GGACGGACTC
AACCGGCCGG AAGCTTATTA TCGCTACACC TTACATGACA GCTGTTATCG CCGCTGGCAT
TACGAAAAAG GAGAGTATCT GGAGGATTGG AATAACCCGG ACACTCTGGA TTGGTGCCTT
TATCACAAGG GTTGCAAAGG GCAGGATACT TACACGAACT GTGCTAATGC CTGGTGGAAT
GGCGGAGCCA ATTTCTGCGG CTATGCCGGA TCGCCCTGTG CGGGATGCAG TCAGCCTGAG
TATTATGACG GCTTCGCGCC CTTATTTGTG AACCCCAAGG AGGTGAAATA G
 
Protein sequence
MKRDDQPMES GYQRLARRGV SRRDFLKLCT FTCAALGIDL SLAPQIAEAA EANLSKKPVI 
WMQGQGCTGC SESLLSSVDP GPEEIILDLL SVRYHPTLMA ASGEQAVQSL EDCIAQGHYI
LVLEGSIPTA DSRYCFVEGK PFIEQFKLAA EKAEAVIAVG SCACYGGIPR AGFTGAAGAQ
EVLEGVKVVN LPSCPVKPDR LVGLLLYYLS HNALPKLDGL NRPEAYYRYT LHDSCYRRWH
YEKGEYLEDW NNPDTLDWCL YHKGCKGQDT YTNCANAWWN GGANFCGYAG SPCAGCSQPE
YYDGFAPLFV NPKEVK