Gene Daro_3960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3960 
Symbol 
ID3567459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4257038 
End bp4258456 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content66% 
IMG OID637682433 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_287157 
Protein GI71909570 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCC TCATCGTCGG GCCGTTCAAC CGGGTCGAAG GCGATCTCGA AATCAGCCTC 
GATGTGGAAA ATGGACGCAT CCAGTCGGCT CAGGTCAATT CGCCGCTCTT TCGCGGTTTC
GAGCAGATCA TGGTTGGCCG GGCCCCGCTT GATGCATTGG CCATCGTGCC GCGCATCTGT
GGTATTTGCT CGGTGGCACA GTCGGCGGCA GCGGCTTCGG CGCTGGCCGA TGCGATGGGT
ATCGCCCCGA CGCCGAACGG GTTGCTCGCC CGCCATCTGA TCCAGTCGAC GGAAAATCTG
GCCGATCACC TGACGCACTT CTACCTCTTT TTCATGCCCG ACTTTGCCCG TCCGGCCTAT
GCCGGGCGCC ACTGGCATGG CGCTGTCGCC CAGCGCTTCG CGGCCGTCAA GGGTGATGCC
ACTGCCGAAG TCCTGTCGGC CCGCGCCAGC TTCCTCAAGC TGATGGGTTT TCTCGCCGGT
CGCTGGCCGC ACACCCTGGC CATTCAGCCC GGTGGCAGCA CGCGGGCGAT CACTGCTGGC
GAGCGTATTC GCCTGCTGGC GCTGCTCCGC GAATTCCGTA GCTTTCTCGA AAAACGCCTG
TTCGGCGATT CGTTGGCCAC GGTCTCCCAA CTGGCCAGCA AAGATCAATT GCTCGCCTGG
GCCGCCGGCC GTGCTGGTGA CTTCCCGGCG TTTCTGGATG CCGCTACCGA TCTTGGTCTC
GACCGGATGG GAAGCGCCTA CGATGCTTTC CTGTCCTATG GCGCCTACGA TCTCTTTCCC
GCCGGCACCT GGCAAGGCGG TCAAGCGGCT GCCTTCGATC CCGTGGCGAT CGATGAAGAC
ACGACCAGCG CCTGGCTGGC TGCCGGCCAG CCGCGCCATC CGGCGCAGGG TGAAACGGTT
GTCGATGCCG GCAAGCCGGC GGCCTACACC TGGTGCAAGG CACCGCGCTA CGCCAGCCAG
CCCTACGAAG TCGGTGCGTT GGCCAGGCAG GTCATCGCCG GTCACCCGCT GGCGTTGGAT
CTGGTACAAC GCGACGGCGC CAGCGTCATG GCTCGCGTCG TTGCACGCCT GCTCGAACTG
GCCCTTGTCC TGCCCGCCAT GGAAGGCTGG GTTCTGGCGC TACAACCCGG CGAAGCCTTC
TGCGCCCACG GCGATATGCC CGATGACGCC ACTGGCACCG GCCTTGTCGA AGCCGCCCGC
GGCAGCCTCG GCCACTGGCT GAGCATCAAG CGTGGCCGCA TCGAGCGCTA CCAGATCATC
GCCCCGACCA CCTGGAACTT CTCGCCGCGT GATGGCAACG CCCTGCCCGG TCCGCTCGAA
CAGGCACTGG TCGGCCTGCC GGCCGGAGAG GGCGCCCCGC CAACCGTGCA GCACGTCGTG
CGGTCGTTCG ATCCTTGCAT GGTCTGTACC GTGCATTAG
 
Protein sequence
MTRLIVGPFN RVEGDLEISL DVENGRIQSA QVNSPLFRGF EQIMVGRAPL DALAIVPRIC 
GICSVAQSAA AASALADAMG IAPTPNGLLA RHLIQSTENL ADHLTHFYLF FMPDFARPAY
AGRHWHGAVA QRFAAVKGDA TAEVLSARAS FLKLMGFLAG RWPHTLAIQP GGSTRAITAG
ERIRLLALLR EFRSFLEKRL FGDSLATVSQ LASKDQLLAW AAGRAGDFPA FLDAATDLGL
DRMGSAYDAF LSYGAYDLFP AGTWQGGQAA AFDPVAIDED TTSAWLAAGQ PRHPAQGETV
VDAGKPAAYT WCKAPRYASQ PYEVGALARQ VIAGHPLALD LVQRDGASVM ARVVARLLEL
ALVLPAMEGW VLALQPGEAF CAHGDMPDDA TGTGLVEAAR GSLGHWLSIK RGRIERYQII
APTTWNFSPR DGNALPGPLE QALVGLPAGE GAPPTVQHVV RSFDPCMVCT VH