Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_1997 |
Symbol | hupL |
ID | 5152108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 2064383 |
End bp | 2066173 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640556938 |
Product | uptake hydrogenase large subunit precursor |
Protein accession | YP_001238094 |
Protein GI | 148253509 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATCC AGACTCCCAA CGGCTTCAAT CTCGACAATT CCGGCAAGCG CGTCGTCGTC GATCCGCTCA CGCGCATCGA GGGACATCTG CGCGTCGAGG TGAATGTCGA CTCCAACAAT GTCATCCGCA ACGCAGTGTC GACCGGCACG ATGTGGCGCG GCATCGAGAC TATCCTGCGC GGCCGCGATC CGCGCGACGC CTGGGCCTTC ACCGAGCGGA TCTGCGGCGT CTGCACCGGC ACCCACGCGC TGACCTCGGT GCGCGCGGTC GAGAATGCAC TGGTGATCAT GATCCCCGAC AACGCCAATT CGATCCGCAA CATCATGCAG CTCTGCCTGC AGGTGCACGA CCATCTCGTG CACTTCTATC ACCTGCATGC GCTGGATTGG GTCGACGTCG TCTCGGCGCT CAAGGCCGAT CCGAAGGCAA CCTCTGCGCT GGCGCAGTCG ATCTCGGACT GGCCGCTGTC GTCGCCGGGC TATTTCAAGG ACCTGCAGAT CCGGCTGACG AAGTTCGTCG AGTCCGGCCA GCTCGGTCCG TTCAAGAACG CCTATTGGGG CCACGCGGCC TACAAGCTGC CGCCCGAAGC GAACCTGATG GCGGTCGCGC ATTATCTGGA GGCGCTCGAC TTCCAGAAGG AGATCGTCAA GATCCACACC ATCTATGGCG GCAAGAATCC GCATCCGAAC TGGCTGGTCG GCGGCGTGCC CTGCGCCATC AATGTCGACG GGACCGGTGC GGTCGGCGCC ATCAATATGG AGCGGCTGAA TCTCGTTTCC TCCATCATCG ACCGCTCGAT CGAGTTCGTG CAGAAGGTCT ATCTGCCCGA CGTCGTCGCC ATCGGCTCGT TCTACAAGGA CTGGCTCTAT GGCGGTGGCC TCTCGGGCAA GAGCGTGATG TCCTATGGCG ACATTCCGGA GAACGCCAAC GACTATTCGG CCAAGAACCT CAAGCTGCCG CGCGGCGTGA TCCTCAACGG CAATCTCAAC GAGATCCTGC CGATCGATCA CGGCGACCCC GAGCAGATCC AGGAGTTCGT CACCCACTCC TGGTACAAAT ATCCCGACGA GAGCAAGGGG CTGCATCCCT GGGACGGCGT CACCGAGCCG AACTATCAGC TCGGCCCCAA TGCCAAGGGC ACCAAGACCG ACATCAAGGA GCTCGACGAG GGCGGCAAGT ACTCCTGGAT CAAGGCGCCG CGCTGGCGCG GCAACGCGGT CGAGGTCGGC CCCCTGGCGC GCTACATCAT CGGCTATGCG CAGAACAGGC CGGAGTTCAA GGAGCCGACC GACAAGCTCC TGAAGGCGCT GAACCTGCCG GTGACGGCGC TGTTCTCGAC GCTCGGTCGC ACCGCCGCGC GTGCGCTCGA ATGCGACTGG GCCGCGACCC AGATGCGCTA CTTCCAGGAC AAGCTGGTGG CGCGCATCAA GGCCGGCGAT TCCTCGACCG CGAACATCGA GAAGTGGAAG CCGGAGAGCT GGCCCAAGGA GGCCAAGGGC TATGGCTTCA CCGAGGCGCC GCGCGGCGCG CTGGCGCACT GGATCAAGAT CAAGGAGACC AGGATCGACA ACTACCAGTG CGTTGTGCCG ACCACCTGGA ACGGCTCGCC GCGTGACCCC AAGGGCAATA TCGGCGCCTT CGAGGCGTCG CTGATGGATA CGCCGATGGC GGATCCGGAG AAGCCGCTGG AGATCCTGCG GACGATTCAT TCGTTCGATC CGTGCCTTGC GTGCTCCACC CACGTGATGA GCCCGGACGG CCAGGAAATG GCGACCGTCA AGGTCAGGTA G
|
Protein sequence | MGIQTPNGFN LDNSGKRVVV DPLTRIEGHL RVEVNVDSNN VIRNAVSTGT MWRGIETILR GRDPRDAWAF TERICGVCTG THALTSVRAV ENALVIMIPD NANSIRNIMQ LCLQVHDHLV HFYHLHALDW VDVVSALKAD PKATSALAQS ISDWPLSSPG YFKDLQIRLT KFVESGQLGP FKNAYWGHAA YKLPPEANLM AVAHYLEALD FQKEIVKIHT IYGGKNPHPN WLVGGVPCAI NVDGTGAVGA INMERLNLVS SIIDRSIEFV QKVYLPDVVA IGSFYKDWLY GGGLSGKSVM SYGDIPENAN DYSAKNLKLP RGVILNGNLN EILPIDHGDP EQIQEFVTHS WYKYPDESKG LHPWDGVTEP NYQLGPNAKG TKTDIKELDE GGKYSWIKAP RWRGNAVEVG PLARYIIGYA QNRPEFKEPT DKLLKALNLP VTALFSTLGR TAARALECDW AATQMRYFQD KLVARIKAGD SSTANIEKWK PESWPKEAKG YGFTEAPRGA LAHWIKIKET RIDNYQCVVP TTWNGSPRDP KGNIGAFEAS LMDTPMADPE KPLEILRTIH SFDPCLACST HVMSPDGQEM ATVKVR
|
| |