Gene Ddes_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDdes_1039 
Symbol 
ID7284720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 
KingdomBacteria 
Replicon accessionNC_011883 
Strand
Start bp1215541 
End bp1216488 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content58% 
IMG OID643581840 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_002479624 
Protein GI220904312 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.370707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATTG CCGTGGGTCT GGGCAAAGAA GGCGGAGAAG AGCGTTTGGA GCGCCAGGGC 
ATCAGCCGCC GCGATTTCAT GAAATTTTGT ACAGCGGTGG CCGTGGCCAT GGGCATGGGA
CCGGCTTTTG CTACCGATGT GGCTGCCGCC CTTACTGGGC GTCGTCCTTC GGTGGTGTAC
CTGCACGCCG CCGAGTGCAC GGGCTGTTCC GAAGCGCTGT TGCGCACCTA TCAGCCCTTC
ATTGACACTC TTATTCTTGA CACCATTTCT CTTGACTATC ACGAAACCAT CATGGCCGCC
GCCGGTGAAG CTGCCGAAGA GGCCCTTCAG GCCGCCGTCA ACGGGCCTGA CGGTTTCATC
TGCCTTGTCG AAGGCGCTAT CCCCACGGGC ATGGACAACA AGTACGGCTA TATTGCCGGA
CACACCATGT ACGACATCTG CAAAAACATC CTGCCCAAGG CCAAGGCCGT TGTCAGCATC
GGCACCTGTG CCTGTTACGG CGGCATTCAG GCGGCCAAGC CCAATCCCAC CGCCGCCAAG
GGCATCAATG ACTGCTATGC GGATCTGGGC GTCAAGGCCA TCAACGTGCC TGGCTGCCCG
CCCAACCCCC TGAATATGGT TGGTACTCTT GTGGCTTTCC TCAAGGGCCA AAAAATCGAG
CTGGACGAAG TGGGCCGCCC CGTTATGTTC TTTGGACAGA GCGTGCATGA CCTCTGCGAA
CGCCGCAAGC ATTTCGACGC CGGTGAGTTT GCGCCTTCCT TCAACTCTGA AGAAGCGCGC
AAGGGCTGGT GCCTCTACGA TGTCGGCTGT AAAGGACCAG AGACCTACAA CAATTGTCCC
AAGGTTCTTT TCAACGAGAC CAACTGGCCG GTGGCTGCCG GGCATCCCTG CATAGGCTGC
AGTGAGCCCA ATTTCTGGGA CGATATGACG CCGTTCTACC AGAACTAG
 
Protein sequence
MRIAVGLGKE GGEERLERQG ISRRDFMKFC TAVAVAMGMG PAFATDVAAA LTGRRPSVVY 
LHAAECTGCS EALLRTYQPF IDTLILDTIS LDYHETIMAA AGEAAEEALQ AAVNGPDGFI
CLVEGAIPTG MDNKYGYIAG HTMYDICKNI LPKAKAVVSI GTCACYGGIQ AAKPNPTAAK
GINDCYADLG VKAINVPGCP PNPLNMVGTL VAFLKGQKIE LDEVGRPVMF FGQSVHDLCE
RRKHFDAGEF APSFNSEEAR KGWCLYDVGC KGPETYNNCP KVLFNETNWP VAAGHPCIGC
SEPNFWDDMT PFYQN