Gene BBta_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1997 
SymbolhupL 
ID5152108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2064383 
End bp2066173 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content63% 
IMG OID640556938 
Productuptake hydrogenase large subunit precursor 
Protein accessionYP_001238094 
Protein GI148253509 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCC AGACTCCCAA CGGCTTCAAT CTCGACAATT CCGGCAAGCG CGTCGTCGTC 
GATCCGCTCA CGCGCATCGA GGGACATCTG CGCGTCGAGG TGAATGTCGA CTCCAACAAT
GTCATCCGCA ACGCAGTGTC GACCGGCACG ATGTGGCGCG GCATCGAGAC TATCCTGCGC
GGCCGCGATC CGCGCGACGC CTGGGCCTTC ACCGAGCGGA TCTGCGGCGT CTGCACCGGC
ACCCACGCGC TGACCTCGGT GCGCGCGGTC GAGAATGCAC TGGTGATCAT GATCCCCGAC
AACGCCAATT CGATCCGCAA CATCATGCAG CTCTGCCTGC AGGTGCACGA CCATCTCGTG
CACTTCTATC ACCTGCATGC GCTGGATTGG GTCGACGTCG TCTCGGCGCT CAAGGCCGAT
CCGAAGGCAA CCTCTGCGCT GGCGCAGTCG ATCTCGGACT GGCCGCTGTC GTCGCCGGGC
TATTTCAAGG ACCTGCAGAT CCGGCTGACG AAGTTCGTCG AGTCCGGCCA GCTCGGTCCG
TTCAAGAACG CCTATTGGGG CCACGCGGCC TACAAGCTGC CGCCCGAAGC GAACCTGATG
GCGGTCGCGC ATTATCTGGA GGCGCTCGAC TTCCAGAAGG AGATCGTCAA GATCCACACC
ATCTATGGCG GCAAGAATCC GCATCCGAAC TGGCTGGTCG GCGGCGTGCC CTGCGCCATC
AATGTCGACG GGACCGGTGC GGTCGGCGCC ATCAATATGG AGCGGCTGAA TCTCGTTTCC
TCCATCATCG ACCGCTCGAT CGAGTTCGTG CAGAAGGTCT ATCTGCCCGA CGTCGTCGCC
ATCGGCTCGT TCTACAAGGA CTGGCTCTAT GGCGGTGGCC TCTCGGGCAA GAGCGTGATG
TCCTATGGCG ACATTCCGGA GAACGCCAAC GACTATTCGG CCAAGAACCT CAAGCTGCCG
CGCGGCGTGA TCCTCAACGG CAATCTCAAC GAGATCCTGC CGATCGATCA CGGCGACCCC
GAGCAGATCC AGGAGTTCGT CACCCACTCC TGGTACAAAT ATCCCGACGA GAGCAAGGGG
CTGCATCCCT GGGACGGCGT CACCGAGCCG AACTATCAGC TCGGCCCCAA TGCCAAGGGC
ACCAAGACCG ACATCAAGGA GCTCGACGAG GGCGGCAAGT ACTCCTGGAT CAAGGCGCCG
CGCTGGCGCG GCAACGCGGT CGAGGTCGGC CCCCTGGCGC GCTACATCAT CGGCTATGCG
CAGAACAGGC CGGAGTTCAA GGAGCCGACC GACAAGCTCC TGAAGGCGCT GAACCTGCCG
GTGACGGCGC TGTTCTCGAC GCTCGGTCGC ACCGCCGCGC GTGCGCTCGA ATGCGACTGG
GCCGCGACCC AGATGCGCTA CTTCCAGGAC AAGCTGGTGG CGCGCATCAA GGCCGGCGAT
TCCTCGACCG CGAACATCGA GAAGTGGAAG CCGGAGAGCT GGCCCAAGGA GGCCAAGGGC
TATGGCTTCA CCGAGGCGCC GCGCGGCGCG CTGGCGCACT GGATCAAGAT CAAGGAGACC
AGGATCGACA ACTACCAGTG CGTTGTGCCG ACCACCTGGA ACGGCTCGCC GCGTGACCCC
AAGGGCAATA TCGGCGCCTT CGAGGCGTCG CTGATGGATA CGCCGATGGC GGATCCGGAG
AAGCCGCTGG AGATCCTGCG GACGATTCAT TCGTTCGATC CGTGCCTTGC GTGCTCCACC
CACGTGATGA GCCCGGACGG CCAGGAAATG GCGACCGTCA AGGTCAGGTA G
 
Protein sequence
MGIQTPNGFN LDNSGKRVVV DPLTRIEGHL RVEVNVDSNN VIRNAVSTGT MWRGIETILR 
GRDPRDAWAF TERICGVCTG THALTSVRAV ENALVIMIPD NANSIRNIMQ LCLQVHDHLV
HFYHLHALDW VDVVSALKAD PKATSALAQS ISDWPLSSPG YFKDLQIRLT KFVESGQLGP
FKNAYWGHAA YKLPPEANLM AVAHYLEALD FQKEIVKIHT IYGGKNPHPN WLVGGVPCAI
NVDGTGAVGA INMERLNLVS SIIDRSIEFV QKVYLPDVVA IGSFYKDWLY GGGLSGKSVM
SYGDIPENAN DYSAKNLKLP RGVILNGNLN EILPIDHGDP EQIQEFVTHS WYKYPDESKG
LHPWDGVTEP NYQLGPNAKG TKTDIKELDE GGKYSWIKAP RWRGNAVEVG PLARYIIGYA
QNRPEFKEPT DKLLKALNLP VTALFSTLGR TAARALECDW AATQMRYFQD KLVARIKAGD
SSTANIEKWK PESWPKEAKG YGFTEAPRGA LAHWIKIKET RIDNYQCVVP TTWNGSPRDP
KGNIGAFEAS LMDTPMADPE KPLEILRTIH SFDPCLACST HVMSPDGQEM ATVKVR