Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_0461 |
Symbol | hupL |
ID | 5154212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 460952 |
End bp | 462550 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640555479 |
Product | uptake hydrogenase large subunit |
Protein accession | YP_001236652 |
Protein GI | 148252067 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0412944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0758473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCAG CAGTTCAAAC GCTTGATATT TCACCCGTCG GACGCGTCGA GGGCGACCTC GACGTGCGCG TCGATATCCA GAACGGCGTC GTCGTTAATG CGTGGACCCA GGCCGAACTC TTCCGCGGCT TCGAGGTGAT CCTTCGCGAC AAGGATCCTC AAGCGGGACT CGTAGTGACG CCACGCGCGT GTGGCATCTG CGGCGCCTCG CATCTGACTT GCGCCGCCTG GGCGCTCGAC ACCGCGTGGA AGACCGAGGT TCCCCGCAAC GCCATCCTCG CGCGCAATCT CGGACAGATC GCAGAGAGCC TGCAGAGCCT TCCCAGGCAC CATTACGGCC TCTTCATGAT CGACTACACG CACAAGAACT ACTCGCGCTC CAAATATTAC GAGGAGGCGG TTCGGCGATG GTCACCATAC ACGGGTACCA ACTACGAGCT CGGCGTTACG ATTTCGGGCC GTCCCGTCGA AATTTATGCG CTTCTCGGTG GCCAGTGGCC GCATTCGAGT TTCATGGTCC CTGGCGGGGT GATGTGCGCG CCCACGCTGA CTGACGTCAC CCGCGCCTGG TCGATTCTGG AGCATTTCCG GCGCAACTGG ATGGAGCCGG TATGGCTTGG CTGCTCCTTC GAGCGCTACG AGGAAATCAA ATCCTACGAC GACTTCATGG CGTGGCTAGA CGAGCGGCCC GAGCATGCCA ACTCAGACCT CGGGATGTTC TGGCGCATGA GCCAGGACAT CGGCGTCGAC AAATACGGCA AGGGCCACGG AAAATACGTG TCCTGGGGAT ATCTGCCTCA TGAGGACAAG TACAACCGCC CGACGATCGA GGGCCGCAAT GCGGCGGTGA TCATGAAGAG CGGTGTTTAT GATGGCGCCA GTGACACCCA TAAGCTGATG GATCAGATTC ACACCCGCGA GGATCTGATG CATGCCTGGT ACGATGAACA GAGCGGCAAA CACCCCTTCG ACCGAGTCAC CAAGCCGGTT GGCAAAAACC CCGTCGATCA CACTAAGCAA TATTCATGGG CGACGGCCGT TCGCCACGAC CAGAACGGCA GGCTCGAAGC CGGTCCGCTG GCGCGCCAGC TCATTGCTGG CGGACCGCAC GGAGAACGCT GGCAGCATCA CGATCCGCTG GTGCTCGACA TGTATAGGAA ACTCGGAGGC GCAAGCGTCA TGCTGCGTCA TTTCGCCCGC ATGCATGAAG GCGTGAAGCT CTATCGGCAA GCCGAGCATG CCCTGCGCGA ATTCCGGCTG AACGATCCCT GGTATGTCAA ACCGACGGAA AAGGATGGGC GGGGCTGGGG CGCCACCGAG GCGATCCGCG GCGCCCTTTG TCACTGGATC GAGGTGCAGG GCGGGAAGAT CAAGAACTAC CAGATCATTA CGCCAACGAC CTGGAACGTC GGTCCGCGTT CCGACCGTGA TGAACTTGGC CCGATCGAGC AAGCGCTCAT CGGAACTCCG GTTGCCGACG TGAATGATCC TGTAGAGGTC GGGCATGTTT GTCGCTCATA TGACTCGTGC CTCGTCTGTA CTGTGCACGC CCATCACGCC AGCACGGGCA AGGAACTTGC ACGTTTCCGC ACGGCCTAG
|
Protein sequence | MSAAVQTLDI SPVGRVEGDL DVRVDIQNGV VVNAWTQAEL FRGFEVILRD KDPQAGLVVT PRACGICGAS HLTCAAWALD TAWKTEVPRN AILARNLGQI AESLQSLPRH HYGLFMIDYT HKNYSRSKYY EEAVRRWSPY TGTNYELGVT ISGRPVEIYA LLGGQWPHSS FMVPGGVMCA PTLTDVTRAW SILEHFRRNW MEPVWLGCSF ERYEEIKSYD DFMAWLDERP EHANSDLGMF WRMSQDIGVD KYGKGHGKYV SWGYLPHEDK YNRPTIEGRN AAVIMKSGVY DGASDTHKLM DQIHTREDLM HAWYDEQSGK HPFDRVTKPV GKNPVDHTKQ YSWATAVRHD QNGRLEAGPL ARQLIAGGPH GERWQHHDPL VLDMYRKLGG ASVMLRHFAR MHEGVKLYRQ AEHALREFRL NDPWYVKPTE KDGRGWGATE AIRGALCHWI EVQGGKIKNY QIITPTTWNV GPRSDRDELG PIEQALIGTP VADVNDPVEV GHVCRSYDSC LVCTVHAHHA STGKELARFR TA
|
| |