Gene Daro_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3961 
Symbol 
ID3567460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4258453 
End bp4259454 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content63% 
IMG OID637682434 
Productuptake hydrogenase accessory protein hupU 
Protein accessionYP_287158 
Protein GI71909571 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value0.462451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGC TTTGGCTGCA GAGCGGCGGC TGCGGCGGTT GCACGCAGTC CATGCTGTGC 
TCCGAGCCTC GTTCGCTGTT CGACGAACTG CGCGACGCCG GTATCGAGTT TCTCTGGCAC
CCGGCGCTGT CGGTCGAAAG CGGCGAGGAA GCACTAAGCA TTCTCGAAGA CTGTGCCGAA
GGTCGCTTGG CCTTCGATGC GCTGTGTATC GAAGGTGCCA TGCTGCGCGG TCCGAATGGA
ACCGGCAAAT TTCACCTGAT GGCCGGTAGC GGCCGGCCGC TGACTGAATG GGTCGAGCGT
CTGGCCCGTC ACGCCAAATG GGTCTTTGCG ATTGGCTCGT GCACAGCCTA TGGCGGCTTT
TCGGCCAACA CGCCGGGCAA TCCGCTGGAA GCCTGTGGCC TGCAATTCGA CGAACAGACG
CCAGGTGGCC TGCTCGGCGC TGGCTTCCAG TCCTCCGCCG AGCTGCCGGT GATCAACATT
GCCGGTTGCC CGACGCATCC CGGCTGGGTG GTCGATACGC TGGAAAAGGC TGCGCTGGAA
GGTATAAGGG CCGATGACCT TGATGAATTC GGCCGGCCAT TGCTTTACGC CGGTGGCCTG
GTGCACCACG GTTGCGCTCG CAACGAATAT TACGAATTCA AGGCCAGTGC CGAGAAGCAG
TCCGATCTTG GCTGTCTGAT GGAAAACCTT GGCTGCAAGG GCACCCAGGC TCACGCCGAC
TGCAACCTGC GGCCGTGGAA CGGCAGCGGC TCCTGCCTGC GTGGCGGCTT TGCTTGCATA
GCCTGCACCG AGCCGGGTTT CGAATCGCCC GGTCATGCCT TCCAGGAAAC GCCCAAGCTG
GCCGGCATCC CGATCGGCCT GCCGACCGAC ATGCCGAAAG CCTGGTTCGT CGCGCTGGCT
GCGCTCTCCA AGTCGGCCAC CCCCAAGCGG GTGCGCAACA ACTCGGTGGC TGACCACCCG
GTGGTTCTTC CGGCGATTCG CAAGAAGGGC GGCGGCAAAT GA
 
Protein sequence
MKVLWLQSGG CGGCTQSMLC SEPRSLFDEL RDAGIEFLWH PALSVESGEE ALSILEDCAE 
GRLAFDALCI EGAMLRGPNG TGKFHLMAGS GRPLTEWVER LARHAKWVFA IGSCTAYGGF
SANTPGNPLE ACGLQFDEQT PGGLLGAGFQ SSAELPVINI AGCPTHPGWV VDTLEKAALE
GIRADDLDEF GRPLLYAGGL VHHGCARNEY YEFKASAEKQ SDLGCLMENL GCKGTQAHAD
CNLRPWNGSG SCLRGGFACI ACTEPGFESP GHAFQETPKL AGIPIGLPTD MPKAWFVALA
ALSKSATPKR VRNNSVADHP VVLPAIRKKG GGK