Gene Daro_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3971 
Symbol 
ID3567470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4266838 
End bp4268550 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content62% 
IMG OID637682444 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_287168 
Protein GI71909581 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value0.533836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC GTGTCACCAT CGACCCGGTC ACCCGTATTG AGGGCCACCT GCGCGTCGAC 
GTCGAAGTGG ATGGCGGCCG CGTCAAGAAG GCCTGGGCAT CGGGCCAGAT GTGGCGGGGC
GTCGAGAACA TCCTGATCGG CCGCGACCCG CGCGACGCCT GGGCGATCAC CCAGCGCATC
TGCGGCGTTT GCACCACCGT GCATGCCATG GCCTCGGTCC GTGCGGTCGA GAACGCGCTG
CAGTTGGAAA TCCCGGTCAA CGCCCAGTAC ATCCGCAACA TGATCATGCT GGCCCATGCC
GTGCATGACC ATATTGTCCA TTTCTATCAC CTCTCGGCAC TCGACTGGGT CGATGTCGTC
TCGGCGCTGA AGGCTGATCC GGCCAAGGCG GCCAGTCTGG CGCAGAGCCT GTCGAACTGG
AGCGGCAACA GTCAGGCCGA GTTCAAGAAG GTGCAGGATC GGCTGAGCGG CTTTGTTGGC
ACCGGACAGC TCGGTATTTT CACCAATGGT TATTGGGGGC ACCCGGCGAT GAAACTGTCG
CCCGAGGTCA ACCTGATGGC CACGGCGCAC TACCTGCAGG CCTTGGAAAT CCAGCGTTAC
GCCAGCAAGA TCGTCGCCGT ACTCGGTTCA AAATCGCCGC ACATCCAGAA TGTCGCGGTC
GGCGGCGTGG CCAATCCGCT GTCGGTCGAT TCGCAATCGG TGCTGACCAT CGAGCGCCTG
CTGGCGATCA AGGAGTGGAT GCTCAAGCTC GAAGACTTCG TCAAGAACGT CTATCTGGTC
GACGTCGGCG CCATCGGCGG CTTCTATGCC GACTGGACCA AGTACGGCAA GGGCATCACC
GACTACCTGT GCGTGCCGGA CATCCCGCTC GATGGCAAGG GCACGACTTT CGCGCTGCCC
GGCGGCTACA TCGAAGGCGG CAAGCTGGAG AGCTACAAGG CGATCACCAG CTTTAACGAC
AAATACTTCA CCGACGGTGT ATCCGAGGCG ATCAAGCATT CCTGGTACAA CTACAGCGAC
GGCAACGACA AGTCGCTGCA CCCGTACAAG GGTGAAACGA CGCCGAACTA CACCGATTTC
CAGGATGATG GAAAGTACTC GTGGCTGAAG TCGCCGACCT TCTATGGCAA GCCGATGCAG
GTCGGCCCAC TGCCGCGCGT GCTGAACATG CTGGCCGCCG GCCACGAACC GACCAAGAAG
TACGCGACCG CGGCGCTCGA TCTGGTGTCT TCGGTGGCCG GCACCAAGGT CGGCGTCGAA
GCGCTGCATT CGACGATCGG CCGCCATGCG GCGCGGGCTG TTGGCTGCGC TGTGCAGGTT
GATGAACTGA TCAACCAGTG GGATCTGCTG CTCGCCAACA TGGCCAAGGG CGACCTCAAG
ACCTTCAATC GCCCGGTCTT CCCAAAGGGC GAACAGATGG GCGTCGGTTT CCACGAAGCG
CCGCGCGGCG TGCTGTCGCA CTGGGTGGTC ATCGATTCCG GCAAGATCAA GAACTACCAG
TGCGTCGTGC CGACCACCTG GAATGCCGCG CCGCGCAACG AGAAGGACCA GCCCGGCGCC
TACGAGGCCT GCCTGATCGA CAACCCGGTG GCCGATCCCG AGAAGCCGCT GGAAGTCCTG
CGCACGGTGC ACTCCTTCGA CCCCTGCCTG GCCTGTGCCG TGCATGTGGT CGATCAGGAG
AACAATCCGG TGGTCACGGT GACGGCTGTT TGA
 
Protein sequence
MNKRVTIDPV TRIEGHLRVD VEVDGGRVKK AWASGQMWRG VENILIGRDP RDAWAITQRI 
CGVCTTVHAM ASVRAVENAL QLEIPVNAQY IRNMIMLAHA VHDHIVHFYH LSALDWVDVV
SALKADPAKA ASLAQSLSNW SGNSQAEFKK VQDRLSGFVG TGQLGIFTNG YWGHPAMKLS
PEVNLMATAH YLQALEIQRY ASKIVAVLGS KSPHIQNVAV GGVANPLSVD SQSVLTIERL
LAIKEWMLKL EDFVKNVYLV DVGAIGGFYA DWTKYGKGIT DYLCVPDIPL DGKGTTFALP
GGYIEGGKLE SYKAITSFND KYFTDGVSEA IKHSWYNYSD GNDKSLHPYK GETTPNYTDF
QDDGKYSWLK SPTFYGKPMQ VGPLPRVLNM LAAGHEPTKK YATAALDLVS SVAGTKVGVE
ALHSTIGRHA ARAVGCAVQV DELINQWDLL LANMAKGDLK TFNRPVFPKG EQMGVGFHEA
PRGVLSHWVV IDSGKIKNYQ CVVPTTWNAA PRNEKDQPGA YEACLIDNPV ADPEKPLEVL
RTVHSFDPCL ACAVHVVDQE NNPVVTVTAV