Gene BBta_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_0475 
SymbolhypE 
ID5152663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp472347 
End bp473417 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID640555493 
Producthydrogenase maturation 
Protein accessionYP_001236666 
Protein GI148252081 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.254195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTC TGGACCTGCC CCCGCGGCGC TCGCTCGGCC GAGTCCATGT GCCAGCCGTC 
ACACTGGCGC ATGGCGGCGG CGGCAAGGCC ATGAAGGATC TGATCGACGA CGTCTTCGTC
AGCGCCTTCT GCAATGCCAA GGCGCCGGAT GTACTGGAGG ATCAGGCGCG GCTCGACCTC
GCGGCGCTTG CCCGCTATGG CGACCGGCTC GCCTTCACCA CGGACTCCTT CGTCGTCGAT
CCGCTGTTCT TCCCCGGGGG CGATATCGGC AAGCTCGCGG TCTGCGGCAC GATCAACGAT
CTGGCCGTCG GCGGCGCCAA GCCGCTTTAT CTGTCCTGCG CCGTCGTCAT CGAGGAAGGA
ATGCCGCTCG ATGCTTTGCG CGGGATTGCG AATTCCATGG CTGAAGCGGC GAGAATGGCT
GGTGTGCGGA TCGTGACCGG CGATACCAAA GTCGTCCAGC GGGGCGCCTG CGACAAGCTC
TTTATCACAA CGACCGGCAT CGGCGTGATC CCGCCCCAGA TTGACCTCGG CATTCACCAG
ATCAAACCCG GAGATGGCAT ACTGGTGAAC GGACTGCTCG GCGACCACGG CGCAGCGATC
CTCGCGGCCC GAGGCGATCT TGCGCTGGAG ACTGAAGTCG CCAGCGACTG CGCCGCCCTA
CACGGGCTGA TCGAGGCCCT TCTACGAGCG GCGCCTGGAA CACGCTGCAT CCGCGACGCC
ACCCGCGGCG GCCTCGCCAC GGTGCTCAAT GAGATGGCGG AGGCCTCCGC GCTGTCGATC
GAAATCGACG AGTCGGCGAC GCCGCTGCGT GAAGAGGTGC GCGGCTTCTG CGAGATTCTC
GGGCTTGATC CTCTCTATCT CGCCAACGAG GGCAAGGTGG TCATCGCTGT GCCGCCTGCC
GAGATCGAAG CCGCGCTTGC GGCAATGCGC GCCCATCCGC TTGGCGCGGG AGCAGCCCTG
ATCGGCCATG CCAGCGGGGG AATTCCAGCG CGCGTCACCA TGCAGACTGT CTTCGGCGGG
AAGCGCATCG TCGATATGCT GATTGGTGAA CAGCTTCCAC GCATCTGTTG A
 
Protein sequence
MNLLDLPPRR SLGRVHVPAV TLAHGGGGKA MKDLIDDVFV SAFCNAKAPD VLEDQARLDL 
AALARYGDRL AFTTDSFVVD PLFFPGGDIG KLAVCGTIND LAVGGAKPLY LSCAVVIEEG
MPLDALRGIA NSMAEAARMA GVRIVTGDTK VVQRGACDKL FITTTGIGVI PPQIDLGIHQ
IKPGDGILVN GLLGDHGAAI LAARGDLALE TEVASDCAAL HGLIEALLRA APGTRCIRDA
TRGGLATVLN EMAEASALSI EIDESATPLR EEVRGFCEIL GLDPLYLANE GKVVIAVPPA
EIEAALAAMR AHPLGAGAAL IGHASGGIPA RVTMQTVFGG KRIVDMLIGE QLPRIC