Gene BBta_2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2005 
SymbolhupK 
ID5150434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2070104 
End bp2071210 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content67% 
IMG OID640556946 
Producthydrogenase expression/formation 
Protein accessionYP_001238102 
Protein GI148253517 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.432728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGG CGTTTCGCAA CCACATCGAT GTCACGCTGT CGGTTGCAGC TCAGATGATC 
GTCGGTGTCA CGATCGAACC GCGCTCGCGG CCGCCGCTCG GTCGGCTGTT TGCCGGCAAG
CCGGCCGAGA CGCTGCTGGC GGCGTTACCG CGGCTGTTCT CATTATGCGC CATCGCGCAT
CAGGTCGCCC TTTTGTCTGC GCTCGAAGCT GCACGTGGCC ATCAGGCGCC GCCGCTGACG
CGGCATCGGC GCATCACCGC CGTGATCATG GAGCGGTTCG CCGAGTTGCT GCGTGGCGTT
CTGGTCAGCC GTCTCGCTTG CGATCGCAGC GCCCTGGCGC AGCTGCAGTT TCTGCTGCAG
GCGGTGGCCT CGCTCCAGGT GTCCGCGGGC GCCGGAAACG CGCGGCAATC TCGCAGCGCA
ACCTTGTCGC AGATCAAGAT GGCCCTGGCC GCCCTGGGTT TAGGATCGGT TGCGGAACCC
GTGGTGCGGG GAACTCCGCT TGCGTCGATC ATGGATGCGG CCCGCAGGGC CGAAGCCGAT
GGAGGATGGA AGCAGATGCC GGCCGAGCAC GGTGTCCTGT CGGCCGCCGA CGATGACACC
GTCGTGGCTC GACTGATCGA TCCGCAGGCC GCCTTCGCCG AGGCGCCCGA ACTTGCCGGG
CGTGTTCCGG AAACCGGAGT CTGGGCGCGG CAAGCATCGC GTCATCGGCA TTCATCCGCG
GGGTCCGTCG AGCGGCTCTT GGCAAAGTTG GCCGAACTCG CCGAGCTGCT GTGCTGGATC
GAAGCCGGCG AGGCCGAAGA TGAAGCGGCC GACCAGGACG TCGTTGCAAG CTATGCGCTC
GGCCCAGGGC GCGGTGCGGC GGCCGTCGAA TGCGCGCGCG GCCGGCTGCA TCACGCGATC
GAGCTGGATG CGGAAGGCCG TGTCCGCCGG TTCGAATTTC TCGCGCCGAC CGAATGGAAT
TTTCATCCCC GCGGCCCGGT TGCCGGCAGC CTGACTGGGG CTCGGCTTTG CGGCTCCGCC
GATCGTGCCG CGATCGAGGC GATGATCAGT TCCTTCGATC CCTGCGTCGG CTACAGTCTC
GCGGTACGGG AGATGGCTGA TGCATGA
 
Protein sequence
MTLAFRNHID VTLSVAAQMI VGVTIEPRSR PPLGRLFAGK PAETLLAALP RLFSLCAIAH 
QVALLSALEA ARGHQAPPLT RHRRITAVIM ERFAELLRGV LVSRLACDRS ALAQLQFLLQ
AVASLQVSAG AGNARQSRSA TLSQIKMALA ALGLGSVAEP VVRGTPLASI MDAARRAEAD
GGWKQMPAEH GVLSAADDDT VVARLIDPQA AFAEAPELAG RVPETGVWAR QASRHRHSSA
GSVERLLAKL AELAELLCWI EAGEAEDEAA DQDVVASYAL GPGRGAAAVE CARGRLHHAI
ELDAEGRVRR FEFLAPTEWN FHPRGPVAGS LTGARLCGSA DRAAIEAMIS SFDPCVGYSL
AVREMADA