Gene Dvul_0719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0719 
Symbol 
ID4662631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp886711 
End bp887685 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content64% 
IMG OID639818937 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_966169 
Protein GI120601769 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.534448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTACA CGACACGCAT TCCGGCAAAA CTCCTTGCTC GACAGGCTGA AGACGGTCGC 
ATGACCCGCC GCGAGTTCAT GAAGTTCTGC GGCATCGTCG CCGTCGCCAT GGGCATGGGC
CCCGGATTCG CCCCTGCCGT CGCCGAGGCG CTTCAGGCAA AGGGGCGCCC CAGCGTGGTG
TACATGCATG GCGCAGAATG CACTGGCTGC ACCGAAGGTC TTCTCCGTTC CATCGACCCC
TTCATCGACA TCCTGATGAT GGAGGTCATC TCGCTGGACT ACTGCGAGAC GGTGATGGCG
GCAGCAGGCA GGGCCGCCCA TCATGCGCTG GAGGACGCCC TTCGCAACCC CGCGGGCTAC
GTCTGCACCA TCGAAGGTGC CATTCCCACC CGCAAGGGCG GGGTCTACGG GCAGGTCGGT
GGCGAGACCA TGCTCTCGCT GTTCAGCCGG GTGGCGAGCG GGGCCAAGGC TGTCATCGCC
ATGGGCACAT GTGCGAGCTT CGGCGGCATA CAGGCAGCCG CCCCCAACCC TTCGGGAGCC
ATCGGCGTAC GCGAAGCCCT TGCCCCGTTC GGCATCCAGC CCATCAACAT CGCAGGATGC
CCCCCCAACC CGGTGAACTA CATAGGTACC GTCGTCCATC TGCTCACCAA GGGCATGCCC
GAACTCGACA GTGTCGGTAG GCCGAAGATG TTCTACGGCA CGACCGTGCA CGACCAGTGT
GAAAGACGGA AGCACTTCAA CGCCGGCGAG TTCGCCCCCG GCTTCGAATC GAAGGAGGCA
CGTGAAGGCT GGTGCCTGCA CAAGCTGGGA TGTCGAGGGC CCTACACCTA CAACAACTGC
CCGACCGCCC AGTTCAATCA GGTCAACTGG CCGGTCAGGG CTGGAGCCCC TTGCATTGGC
TGCAGCGAAC CCGGCTTCTG GGACGCGCTG GCCCCCTTCA ACAAAGATGT CCGCCAGAAG
AGCGACAAGG CCTAA
 
Protein sequence
MSYTTRIPAK LLARQAEDGR MTRREFMKFC GIVAVAMGMG PGFAPAVAEA LQAKGRPSVV 
YMHGAECTGC TEGLLRSIDP FIDILMMEVI SLDYCETVMA AAGRAAHHAL EDALRNPAGY
VCTIEGAIPT RKGGVYGQVG GETMLSLFSR VASGAKAVIA MGTCASFGGI QAAAPNPSGA
IGVREALAPF GIQPINIAGC PPNPVNYIGT VVHLLTKGMP ELDSVGRPKM FYGTTVHDQC
ERRKHFNAGE FAPGFESKEA REGWCLHKLG CRGPYTYNNC PTAQFNQVNW PVRAGAPCIG
CSEPGFWDAL APFNKDVRQK SDKA