Gene BBta_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_0461 
SymbolhupL 
ID5154212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp460952 
End bp462550 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content60% 
IMG OID640555479 
Productuptake hydrogenase large subunit 
Protein accessionYP_001236652 
Protein GI148252067 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0412944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0758473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAG CAGTTCAAAC GCTTGATATT TCACCCGTCG GACGCGTCGA GGGCGACCTC 
GACGTGCGCG TCGATATCCA GAACGGCGTC GTCGTTAATG CGTGGACCCA GGCCGAACTC
TTCCGCGGCT TCGAGGTGAT CCTTCGCGAC AAGGATCCTC AAGCGGGACT CGTAGTGACG
CCACGCGCGT GTGGCATCTG CGGCGCCTCG CATCTGACTT GCGCCGCCTG GGCGCTCGAC
ACCGCGTGGA AGACCGAGGT TCCCCGCAAC GCCATCCTCG CGCGCAATCT CGGACAGATC
GCAGAGAGCC TGCAGAGCCT TCCCAGGCAC CATTACGGCC TCTTCATGAT CGACTACACG
CACAAGAACT ACTCGCGCTC CAAATATTAC GAGGAGGCGG TTCGGCGATG GTCACCATAC
ACGGGTACCA ACTACGAGCT CGGCGTTACG ATTTCGGGCC GTCCCGTCGA AATTTATGCG
CTTCTCGGTG GCCAGTGGCC GCATTCGAGT TTCATGGTCC CTGGCGGGGT GATGTGCGCG
CCCACGCTGA CTGACGTCAC CCGCGCCTGG TCGATTCTGG AGCATTTCCG GCGCAACTGG
ATGGAGCCGG TATGGCTTGG CTGCTCCTTC GAGCGCTACG AGGAAATCAA ATCCTACGAC
GACTTCATGG CGTGGCTAGA CGAGCGGCCC GAGCATGCCA ACTCAGACCT CGGGATGTTC
TGGCGCATGA GCCAGGACAT CGGCGTCGAC AAATACGGCA AGGGCCACGG AAAATACGTG
TCCTGGGGAT ATCTGCCTCA TGAGGACAAG TACAACCGCC CGACGATCGA GGGCCGCAAT
GCGGCGGTGA TCATGAAGAG CGGTGTTTAT GATGGCGCCA GTGACACCCA TAAGCTGATG
GATCAGATTC ACACCCGCGA GGATCTGATG CATGCCTGGT ACGATGAACA GAGCGGCAAA
CACCCCTTCG ACCGAGTCAC CAAGCCGGTT GGCAAAAACC CCGTCGATCA CACTAAGCAA
TATTCATGGG CGACGGCCGT TCGCCACGAC CAGAACGGCA GGCTCGAAGC CGGTCCGCTG
GCGCGCCAGC TCATTGCTGG CGGACCGCAC GGAGAACGCT GGCAGCATCA CGATCCGCTG
GTGCTCGACA TGTATAGGAA ACTCGGAGGC GCAAGCGTCA TGCTGCGTCA TTTCGCCCGC
ATGCATGAAG GCGTGAAGCT CTATCGGCAA GCCGAGCATG CCCTGCGCGA ATTCCGGCTG
AACGATCCCT GGTATGTCAA ACCGACGGAA AAGGATGGGC GGGGCTGGGG CGCCACCGAG
GCGATCCGCG GCGCCCTTTG TCACTGGATC GAGGTGCAGG GCGGGAAGAT CAAGAACTAC
CAGATCATTA CGCCAACGAC CTGGAACGTC GGTCCGCGTT CCGACCGTGA TGAACTTGGC
CCGATCGAGC AAGCGCTCAT CGGAACTCCG GTTGCCGACG TGAATGATCC TGTAGAGGTC
GGGCATGTTT GTCGCTCATA TGACTCGTGC CTCGTCTGTA CTGTGCACGC CCATCACGCC
AGCACGGGCA AGGAACTTGC ACGTTTCCGC ACGGCCTAG
 
Protein sequence
MSAAVQTLDI SPVGRVEGDL DVRVDIQNGV VVNAWTQAEL FRGFEVILRD KDPQAGLVVT 
PRACGICGAS HLTCAAWALD TAWKTEVPRN AILARNLGQI AESLQSLPRH HYGLFMIDYT
HKNYSRSKYY EEAVRRWSPY TGTNYELGVT ISGRPVEIYA LLGGQWPHSS FMVPGGVMCA
PTLTDVTRAW SILEHFRRNW MEPVWLGCSF ERYEEIKSYD DFMAWLDERP EHANSDLGMF
WRMSQDIGVD KYGKGHGKYV SWGYLPHEDK YNRPTIEGRN AAVIMKSGVY DGASDTHKLM
DQIHTREDLM HAWYDEQSGK HPFDRVTKPV GKNPVDHTKQ YSWATAVRHD QNGRLEAGPL
ARQLIAGGPH GERWQHHDPL VLDMYRKLGG ASVMLRHFAR MHEGVKLYRQ AEHALREFRL
NDPWYVKPTE KDGRGWGATE AIRGALCHWI EVQGGKIKNY QIITPTTWNV GPRSDRDELG
PIEQALIGTP VADVNDPVEV GHVCRSYDSC LVCTVHAHHA STGKELARFR TA