Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1384 |
Symbol | |
ID | 4664870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1684199 |
End bp | 1686019 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639819614 |
Product | hydrogenases, Fe-only |
Protein accession | YP_966829 |
Protein GI | 120602429 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | [TIGR02512] hydrogenases, Fe-only |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGT TCATCAATGG CAAGGAAGTC CGGTGTGAAC CGGGCAGGAC GATACTTGAG GCCGCACGCG AGAACGGGCA CTTCATTCCC ACGTTGTGCG AACTTGCCGA CATCGGTCAT GCACCCGGGA CGTGCCGGGT CTGTCTGGTC GAGATATGGC GTGACAAGGA GGCCGGGCCG CAGATTGTCA CCTCCTGTAC GACCCCCGTC GAGGAGGGAA TGCGCATCTT CACGCGTACC CCTGAAGTAC GCAGGATGCA GCGGCTACAG GTCGAACTGC TGCTGGCCGA CCATGACCAT GACTGCGCAG CCTGCGCCCG TCATGGAGAC TGCGAGTTGC AGGATGTGGC ACAATTCGTG GGTCTTACCG GTACGCGTCA CCATTTTCCG GATTATGCCC GCAGCCGCAC CCGTGATGTC TCTTCGCCGT CCGTCGTGCG CGACATGGGC AAGTGCATCA GGTGCCTGCG CTGTGTCGCC GTGTGCCGCA ACGTGCAGGG CGTCGATGCC CTCGTGGTGA CGGGAAACGG CATCGGCACC GAAATCGGGC TGCGGCACAA TCGTAGCCAG AGTGCGTCCG ACTGTGTGGG CTGTGGCCAG TGCACATTGG TCTGCCCTGT GGGGGCATTG GCTGGACGGG ACGACGTGGA GCGTGTCATC GACTATCTCT ACGACCCCGA AATCGTCACC GTGTTCCAGT TCGCCCCGGC GGTGCGGGTG GGCCTCGGTG AGGAGTTCGG GCTGCCTCCC GGTTCAAGCG TGGAAGGGCA GGTGCCCACG GCCTTGCGCC TTCTCGGGGC AGACGTGGTA CTCGATACCA ACTTCGCAGC CGACCTCGTC ATCATGGAGG AGGGCACCGA ACTCCTGCAA CGTCTTCGGG GCGGGGCGAA GCTGCCGCTC TTCACCTCCT GCTGCCCCGG CTGGGTGAAT TTCGCCGAGA AGCACCTCCC CGACATCCTG CCGCATGTCT CGACCACACG CTCGCCTCAG CAGTGCCTTG GCGCATTGGC CAAGACCTAT CTTGCGCGCA CCATGAACGT CGCACCGGAG AGGATGCGCG TCGTATCGTT GATGCCCTGC ACGGCGAAGA AGGAAGAGGC CGCACGGCCC GAATTCAGGC GCGACGGTGT CCGGGATGTG GACGCAGTGC TCACCACGCG TGAGTTCGCC CGTCTTCTCC GGCGTGAGGG CATAGACCTC GCCGGACTCG AACCCTCGCC CTGCGACGAC CCCCTGATGG GGCGGGCAAC CGGGGCGGCT GTCATCTTCG GTACGACAGG CGGGGTGATG GAGGCGGCAC TGCGTACGGT CTACCATGTG CTGAACGGCA AGGAACTCGC CCCAGTAGAA CTGCATGCCC TGCGCGGATA CGAGAACGTG CGTGAGGCTG TCGTCCCGCT TGGTGAGGGT AACGGTTCCG TGAAGGTCGC CGTGGTGCAT GGGCTCAAGG CTGCCCGGCA GATGGTCGAG GCGGTTCTTG CAGGGAAGGC CGACCATGTG TTCGTGGAGG TCATGGCATG CCCGGGTGGA TGCATGGACG GAGGCGGTCA GCCGAGGTCG AAGCGCGCCC ACAACCCCAA CGCGCAGGCG CGACGTGCCG CCCTTTTCTC GCTCGATGCG GAAAACGCAC TGCGGCAGTC GCACAACAAT CCGCTCATCG GCAAGGTCTA CGAATCATTC CTTGGCGAGC CCTGTTCGAA TTTGTCTCAC CGTCTGCTGC ACACCCGGTA TGGCGACCGC AAGAGCGAAG TCGCCTACAC CATGCGCGAC ATCTGGCATG AGATGACCCT TGGCAGGCGG GTACGGGGCG ACTCTGATTG A
|
Protein sequence | MNAFINGKEV RCEPGRTILE AARENGHFIP TLCELADIGH APGTCRVCLV EIWRDKEAGP QIVTSCTTPV EEGMRIFTRT PEVRRMQRLQ VELLLADHDH DCAACARHGD CELQDVAQFV GLTGTRHHFP DYARSRTRDV SSPSVVRDMG KCIRCLRCVA VCRNVQGVDA LVVTGNGIGT EIGLRHNRSQ SASDCVGCGQ CTLVCPVGAL AGRDDVERVI DYLYDPEIVT VFQFAPAVRV GLGEEFGLPP GSSVEGQVPT ALRLLGADVV LDTNFAADLV IMEEGTELLQ RLRGGAKLPL FTSCCPGWVN FAEKHLPDIL PHVSTTRSPQ QCLGALAKTY LARTMNVAPE RMRVVSLMPC TAKKEEAARP EFRRDGVRDV DAVLTTREFA RLLRREGIDL AGLEPSPCDD PLMGRATGAA VIFGTTGGVM EAALRTVYHV LNGKELAPVE LHALRGYENV REAVVPLGEG NGSVKVAVVH GLKAARQMVE AVLAGKADHV FVEVMACPGG CMDGGGQPRS KRAHNPNAQA RRAALFSLDA ENALRQSHNN PLIGKVYESF LGEPCSNLSH RLLHTRYGDR KSEVAYTMRD IWHEMTLGRR VRGDSD
|
| |