Gene BBta_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1994 
SymbolhupU 
ID5152128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2060592 
End bp2061608 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content68% 
IMG OID640556935 
Productuptake hydrogenase accessory 
Protein accessionYP_001238091 
Protein GI148253506 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGCG CGGAGGGAAT GACCAGGATG TTGTGGCTGC AGGGCGCGAG CTGCGGCGGC 
TGCACCATGT CCATTCTGGA AAGCGGCGCC TCCGGCTGGT TCGACGAACT GCGCCAGTCC
GGCATCGATC TGGTGTGGCA TCCCTCCGTC AGCGAGGAGA CCGGCGAGGA AGCCGCGGAA
CTGCTCGAGG CCATCCGCGA CGGGCGCGAG CGGCTCGATC TGCTGGTGCT CGAAGGCTCG
GTCGCCCGGG GCCCTAACCT GAGCGGCCGC TTCAACATGC TGGCGGGCAC CAACCGCTCA
ATCTATCATT GGCTGCTCGA TCTCGCCCCG CTGGCCGACT ATGTCGTCGC GGTCGGCAGC
TGCGCGGCTT ATGGCGGCAT TCCCGCGGCC GGCATCAACC CGACCGATGC GGTCGGGTTG
CAGTTCGAGG GGAGCGACGT CGGCGGCGCG CTAGGGGCGG GTTTCCGCTC GAAGCGCGGG
TTGCCGGTGA TCAATGTCGC CGGCTGCGCG CCACACCCCG GCTGGATCAT GGAAAGCCTG
CTTGCGCTCA CGACTGGCGA TCTCACCGCC GACGGCCTCG ACGCCGTCGG GCGTCCCGCC
TTCATCGCCA ACCACCTCGC TCATCATGGC TGCTCGCGCA ACGAGTTCTA TGAGTTCAAG
GCGAGCGCGG AAGCCATGTC GGAGCGGGGC TGTTTGATGG AGCATCTCGG CTGCCGCGCG
ACGCAGGCTG TCGGCGACTG CAATCAGCGG TCCTGGAACG GTGGCGGCTC CTGCACCAAG
GGCGGCTATG CCTGCATCGC CTGCACCTCG CCTGGCTTCG AAAGCGCGCA GAACTATCTG
CAGACCGCGA AGCTCGCCGG CATTCCCGTC GGGCTTCCGA CCGACATGCC CAAGGCGTGG
TTCGTCGCGC TCGCGGCCCT GTCGAAATCG GCGACTCCGC GCCGCGTGCG CGTCAATGCC
ACGGCTGATC ACGTCGTGGT GCCGCCGAGC CGCTCGGGCG ACAAGCGCAG TTCATGA
 
Protein sequence
MVGAEGMTRM LWLQGASCGG CTMSILESGA SGWFDELRQS GIDLVWHPSV SEETGEEAAE 
LLEAIRDGRE RLDLLVLEGS VARGPNLSGR FNMLAGTNRS IYHWLLDLAP LADYVVAVGS
CAAYGGIPAA GINPTDAVGL QFEGSDVGGA LGAGFRSKRG LPVINVAGCA PHPGWIMESL
LALTTGDLTA DGLDAVGRPA FIANHLAHHG CSRNEFYEFK ASAEAMSERG CLMEHLGCRA
TQAVGDCNQR SWNGGGSCTK GGYACIACTS PGFESAQNYL QTAKLAGIPV GLPTDMPKAW
FVALAALSKS ATPRRVRVNA TADHVVVPPS RSGDKRSS