Gene Daro_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3988 
Symbol 
ID3567277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4283347 
End bp4285143 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content61% 
IMG OID637682461 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_287185 
Protein GI71909598 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTT ACGAAACCCA AGGCTTCAAG GTTGATAACT CCGGCAAGCG CGTCGTCGTC 
GATCCGGTCT GCCGCATCGA GGGCCACCTG CGGGTGGAAG TGAATCTCGA CGACAAGAAC
GTCATCCGTA ATGCCGTGTC GACCGGCACC ATGTGGCGCG GTCTGGAAGT GATCCTCAAG
GGCCGCGATC CGCGCGACGC CTGGGCCTTT ACCGAACGCA TCTGCGGCGT CTGTACCGGC
ACCCATGCGC TGACCTCGGT GCGTGCCGTC GAGGATGCGC TGAAGATCCA GATCCCGGAG
AACGCCAACA CCATCCGCAA CCTGATGCAG CTGAACCTTT ACGTTCACGA CCATCTGGTG
CACTTCTATC ACCTGCACGC GCTGGACTGG GTCGATGTCG TGTCGGCGTT GAAGGCTGAT
CCGAAGGCGA CTTCCGCACT GGCCCAGAGC ATTTCTTCCT GGCCGCTGTC CTCGCCGGGT
TACTTCCGCG ACATCCAGAA TCGTCTGAAG AAATTTGTCG AATCCGGCCA GCTCGGCCCG
TTCATGAACG GCTACTGGGG CAACCCGGCC TACAAGCTGC CGCCGGAAGC CAACCTGATG
GCCGTGGCCC ACTACCTCGA AGCCCTCGAT TTCCAGAAGG AAATCGTTAA GGTGCACACC
ATTTTCGGCG GCAAGAATCC GCATCCGAAC TGGCTGGTTG GTGGCGTGCC CTGCGCGATC
AACCTCGAAG GCGTCGGTGC CGTCGGGGCG GTGAACATGG AGCGCCTGAA TCTGGTCAAG
AGCATCATTG ACCGCTGTGC CGAGTTCGTC GAACAGGTCT ACATCCCCGA CCTGCTGGCC
ATCGGCTCCT TCTACAAGGG CTGGCTGTAC GGTGGCGGTT TGTCGTCGAA GAACCTGCTG
TCATACGGCG ATATTCCACA GAAGGCCAAC GATTACACCT CGGGCAACCT GCTGCTGCCG
CGTGGTGCGA TCATCAATGG CAAGCTCGAT GAGATTCACC CGGTCGATCT GAAAGACCCG
GAACAGGTGC AGGAATTCGT CGCCCACTCC TGGTACAAGT ACCCGGACGA AACCAAGGGC
CTGCACCCGT TCGATGGCGT CACCGAACCC AGTTTCGTGC TCGGCCCGAA CACCAAGGGC
ACCAAGACCA ACATCAAGGA ACTCGACGAG GGCGGCAAGT ATTCGTGGAT CAAGGCGCCG
CGCTGGCGTG GTCACGCCAT GGAAGTCGGC TGCCTGCCGC GCATGGTGCT GGGCTACCTG
CAACCCAAGC AGTACCCGGA AATCCACGGT CTGGTCGACG GCGCGCTGAA GAAGCTTGAT
GTGCCGGTTA CTGCGCTGTT CTCGACGCTT GGTCGCACCG CGGCGCGTGG TCTGGAAACG
GCGTACTGCG TCAAGCTGCA GCAACAGCAG TTCGACAAGC TGATGACCAA CCTGAAGTCC
GGTGACCTGA ACACGGCTAA CATCGAGAAG TGGGAACCCA GCACCTGGCC GAAGGAAGCG
ATGGGTGCCG GTTTTACCGA AGCCCCGCGC GGTGCGCTCG GTCACTGGAT CCGCATCAAG
GACACCAAGA TCGACAACTA CCAGTGCGTC GTGCCGACCA CCTGGAACGG CGGTCCGCGT
GACCACAAGG GCCAGATCGG TGCCTTCGAG GCCTCGCTGA TGGATACCCC GGTGGCCAAG
GCCGATGAAC CGCTGGAAAT CCTGCGTACG CTGCATTCCT TCGATCCCTG CCTGGCCTGC
TCGACGCACG TGATGAGCCC GGACGGCCAG GAAATGACGT CGGTCAAGGT CCGCTAA
 
Protein sequence
MSAYETQGFK VDNSGKRVVV DPVCRIEGHL RVEVNLDDKN VIRNAVSTGT MWRGLEVILK 
GRDPRDAWAF TERICGVCTG THALTSVRAV EDALKIQIPE NANTIRNLMQ LNLYVHDHLV
HFYHLHALDW VDVVSALKAD PKATSALAQS ISSWPLSSPG YFRDIQNRLK KFVESGQLGP
FMNGYWGNPA YKLPPEANLM AVAHYLEALD FQKEIVKVHT IFGGKNPHPN WLVGGVPCAI
NLEGVGAVGA VNMERLNLVK SIIDRCAEFV EQVYIPDLLA IGSFYKGWLY GGGLSSKNLL
SYGDIPQKAN DYTSGNLLLP RGAIINGKLD EIHPVDLKDP EQVQEFVAHS WYKYPDETKG
LHPFDGVTEP SFVLGPNTKG TKTNIKELDE GGKYSWIKAP RWRGHAMEVG CLPRMVLGYL
QPKQYPEIHG LVDGALKKLD VPVTALFSTL GRTAARGLET AYCVKLQQQQ FDKLMTNLKS
GDLNTANIEK WEPSTWPKEA MGAGFTEAPR GALGHWIRIK DTKIDNYQCV VPTTWNGGPR
DHKGQIGAFE ASLMDTPVAK ADEPLEILRT LHSFDPCLAC STHVMSPDGQ EMTSVKVR