Gene Dvul_1384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1384 
Symbol 
ID4664870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1684199 
End bp1686019 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content64% 
IMG OID639819614 
Producthydrogenases, Fe-only 
Protein accessionYP_966829 
Protein GI120602429 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGT TCATCAATGG CAAGGAAGTC CGGTGTGAAC CGGGCAGGAC GATACTTGAG 
GCCGCACGCG AGAACGGGCA CTTCATTCCC ACGTTGTGCG AACTTGCCGA CATCGGTCAT
GCACCCGGGA CGTGCCGGGT CTGTCTGGTC GAGATATGGC GTGACAAGGA GGCCGGGCCG
CAGATTGTCA CCTCCTGTAC GACCCCCGTC GAGGAGGGAA TGCGCATCTT CACGCGTACC
CCTGAAGTAC GCAGGATGCA GCGGCTACAG GTCGAACTGC TGCTGGCCGA CCATGACCAT
GACTGCGCAG CCTGCGCCCG TCATGGAGAC TGCGAGTTGC AGGATGTGGC ACAATTCGTG
GGTCTTACCG GTACGCGTCA CCATTTTCCG GATTATGCCC GCAGCCGCAC CCGTGATGTC
TCTTCGCCGT CCGTCGTGCG CGACATGGGC AAGTGCATCA GGTGCCTGCG CTGTGTCGCC
GTGTGCCGCA ACGTGCAGGG CGTCGATGCC CTCGTGGTGA CGGGAAACGG CATCGGCACC
GAAATCGGGC TGCGGCACAA TCGTAGCCAG AGTGCGTCCG ACTGTGTGGG CTGTGGCCAG
TGCACATTGG TCTGCCCTGT GGGGGCATTG GCTGGACGGG ACGACGTGGA GCGTGTCATC
GACTATCTCT ACGACCCCGA AATCGTCACC GTGTTCCAGT TCGCCCCGGC GGTGCGGGTG
GGCCTCGGTG AGGAGTTCGG GCTGCCTCCC GGTTCAAGCG TGGAAGGGCA GGTGCCCACG
GCCTTGCGCC TTCTCGGGGC AGACGTGGTA CTCGATACCA ACTTCGCAGC CGACCTCGTC
ATCATGGAGG AGGGCACCGA ACTCCTGCAA CGTCTTCGGG GCGGGGCGAA GCTGCCGCTC
TTCACCTCCT GCTGCCCCGG CTGGGTGAAT TTCGCCGAGA AGCACCTCCC CGACATCCTG
CCGCATGTCT CGACCACACG CTCGCCTCAG CAGTGCCTTG GCGCATTGGC CAAGACCTAT
CTTGCGCGCA CCATGAACGT CGCACCGGAG AGGATGCGCG TCGTATCGTT GATGCCCTGC
ACGGCGAAGA AGGAAGAGGC CGCACGGCCC GAATTCAGGC GCGACGGTGT CCGGGATGTG
GACGCAGTGC TCACCACGCG TGAGTTCGCC CGTCTTCTCC GGCGTGAGGG CATAGACCTC
GCCGGACTCG AACCCTCGCC CTGCGACGAC CCCCTGATGG GGCGGGCAAC CGGGGCGGCT
GTCATCTTCG GTACGACAGG CGGGGTGATG GAGGCGGCAC TGCGTACGGT CTACCATGTG
CTGAACGGCA AGGAACTCGC CCCAGTAGAA CTGCATGCCC TGCGCGGATA CGAGAACGTG
CGTGAGGCTG TCGTCCCGCT TGGTGAGGGT AACGGTTCCG TGAAGGTCGC CGTGGTGCAT
GGGCTCAAGG CTGCCCGGCA GATGGTCGAG GCGGTTCTTG CAGGGAAGGC CGACCATGTG
TTCGTGGAGG TCATGGCATG CCCGGGTGGA TGCATGGACG GAGGCGGTCA GCCGAGGTCG
AAGCGCGCCC ACAACCCCAA CGCGCAGGCG CGACGTGCCG CCCTTTTCTC GCTCGATGCG
GAAAACGCAC TGCGGCAGTC GCACAACAAT CCGCTCATCG GCAAGGTCTA CGAATCATTC
CTTGGCGAGC CCTGTTCGAA TTTGTCTCAC CGTCTGCTGC ACACCCGGTA TGGCGACCGC
AAGAGCGAAG TCGCCTACAC CATGCGCGAC ATCTGGCATG AGATGACCCT TGGCAGGCGG
GTACGGGGCG ACTCTGATTG A
 
Protein sequence
MNAFINGKEV RCEPGRTILE AARENGHFIP TLCELADIGH APGTCRVCLV EIWRDKEAGP 
QIVTSCTTPV EEGMRIFTRT PEVRRMQRLQ VELLLADHDH DCAACARHGD CELQDVAQFV
GLTGTRHHFP DYARSRTRDV SSPSVVRDMG KCIRCLRCVA VCRNVQGVDA LVVTGNGIGT
EIGLRHNRSQ SASDCVGCGQ CTLVCPVGAL AGRDDVERVI DYLYDPEIVT VFQFAPAVRV
GLGEEFGLPP GSSVEGQVPT ALRLLGADVV LDTNFAADLV IMEEGTELLQ RLRGGAKLPL
FTSCCPGWVN FAEKHLPDIL PHVSTTRSPQ QCLGALAKTY LARTMNVAPE RMRVVSLMPC
TAKKEEAARP EFRRDGVRDV DAVLTTREFA RLLRREGIDL AGLEPSPCDD PLMGRATGAA
VIFGTTGGVM EAALRTVYHV LNGKELAPVE LHALRGYENV REAVVPLGEG NGSVKVAVVH
GLKAARQMVE AVLAGKADHV FVEVMACPGG CMDGGGQPRS KRAHNPNAQA RRAALFSLDA
ENALRQSHNN PLIGKVYESF LGEPCSNLSH RLLHTRYGDR KSEVAYTMRD IWHEMTLGRR
VRGDSD