Gene Dvul_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1386 
Symbol 
ID4664869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1686655 
End bp1687920 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID639819616 
Producthydrogenases, Fe-only 
Protein accessionYP_966831 
Protein GI120602431 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.999103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTA CCGTCATGGA GCGCATCGAA TATGAGATGC ACACTCCGGA CCCCAAGGCC 
GATCCGGACA AGCTCCACTT CGTCCAGATC GACGAGGCAA AATGCATAGG CTGCGACACC
TGTTCGCAGT ACTGCCCCAC CGCCGCCATC TTCGGCGAAA TGGGCGAACC GCACTCCATT
CCCCACATCG AGGCGTGCAT CAACTGCGGC CAGTGCCTCA CGCACTGCCC CGAGAACGCC
ATCTACGAGG CACAGTCGTG GGTGCCTGAA GTCGAGAAGA AGCTGAAGGA CGGCAAGGTG
AAATGCATCG CCATGCCCGC CCCCGCCGTG CGCTATGCGC TGGGCGACGC CTTCGGCATG
CCCGTCGGTT CCGTCACCAC CGGCAAGATG CTCGCGGCCC TGCAGAAGCT CGGCTTCGCC
CATTGCTGGG ACACCGAGTT CACCGCTGAC GTGACCATAT GGGAAGAGGG TTCCGAGTTC
GTGGAACGCC TCACCAAGAA GAGCGACATG CCGCTGCCGC AGTTCACCTC GTGCTGCCCC
GGCTGGCAGA AGTATGCCGA GACCTACTAC CCCGAACTGC TGCCGCACTT CTCCACGTGC
AAGTCGCCCA TCGGCATGAA CGGCGCACTG GCGAAGACCT ACGGCGCAGA GCGGATGAAG
TACGACCCCA AGCAGGTCTA CACCGTCTCC ATCATGCCCT GCATCGCCAA GAAGTACGAA
GGTTTGCGGC CCGAACTGAA GTCCAGCGGC ATGCGCGACA TCGACGCCAC GCTGACCACC
CGTGAGCTGG CCTACATGAT CAAGAAGGCC GGTATCGACT TCGCGAAACT CCCCGACGGC
AAGCGTGACA GCCTCATGGG TGAATCCACC GGCGGTGCCA CCATCTTCGG CGTCACCGGC
GGCGTCATGG AAGCAGCACT CCGCTTCGCC TACGAAGCCG TCACCGGCAA GAAGCCCGAC
AGCTGGGACT TCAAGGCCGT GCGCGGTCTC GATGGCATCA AGGAAGCCAC CGTCAACGTC
GGCGGTACCG ACGTCAAGGT CGCCGTGGTA CACGGGGCCA AGCGGTTCAA GCAGGTCTGC
GACGATGTGA AGGCGGGCAA GTCGCCCTAT CACTTCATCG AATACATGGC CTGCCCCGGC
GGCTGCGTCT GTGGCGGCGG TCAGCCCGTC ATGCCCGGCG TGCTCGAAGC CATGGACCGC
ACCACCACCC GCCTTTACGC GGGCCTGAAG AAGCGCCTCG CCATGGCGAG CGCCAACAAG
GCATAG
 
Protein sequence
MSRTVMERIE YEMHTPDPKA DPDKLHFVQI DEAKCIGCDT CSQYCPTAAI FGEMGEPHSI 
PHIEACINCG QCLTHCPENA IYEAQSWVPE VEKKLKDGKV KCIAMPAPAV RYALGDAFGM
PVGSVTTGKM LAALQKLGFA HCWDTEFTAD VTIWEEGSEF VERLTKKSDM PLPQFTSCCP
GWQKYAETYY PELLPHFSTC KSPIGMNGAL AKTYGAERMK YDPKQVYTVS IMPCIAKKYE
GLRPELKSSG MRDIDATLTT RELAYMIKKA GIDFAKLPDG KRDSLMGEST GGATIFGVTG
GVMEAALRFA YEAVTGKKPD SWDFKAVRGL DGIKEATVNV GGTDVKVAVV HGAKRFKQVC
DDVKAGKSPY HFIEYMACPG GCVCGGGQPV MPGVLEAMDR TTTRLYAGLK KRLAMASANK
A