Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_1994 |
Symbol | hupU |
ID | 5152128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 2060592 |
End bp | 2061608 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640556935 |
Product | uptake hydrogenase accessory |
Protein accession | YP_001238091 |
Protein GI | 148253506 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGGCG CGGAGGGAAT GACCAGGATG TTGTGGCTGC AGGGCGCGAG CTGCGGCGGC TGCACCATGT CCATTCTGGA AAGCGGCGCC TCCGGCTGGT TCGACGAACT GCGCCAGTCC GGCATCGATC TGGTGTGGCA TCCCTCCGTC AGCGAGGAGA CCGGCGAGGA AGCCGCGGAA CTGCTCGAGG CCATCCGCGA CGGGCGCGAG CGGCTCGATC TGCTGGTGCT CGAAGGCTCG GTCGCCCGGG GCCCTAACCT GAGCGGCCGC TTCAACATGC TGGCGGGCAC CAACCGCTCA ATCTATCATT GGCTGCTCGA TCTCGCCCCG CTGGCCGACT ATGTCGTCGC GGTCGGCAGC TGCGCGGCTT ATGGCGGCAT TCCCGCGGCC GGCATCAACC CGACCGATGC GGTCGGGTTG CAGTTCGAGG GGAGCGACGT CGGCGGCGCG CTAGGGGCGG GTTTCCGCTC GAAGCGCGGG TTGCCGGTGA TCAATGTCGC CGGCTGCGCG CCACACCCCG GCTGGATCAT GGAAAGCCTG CTTGCGCTCA CGACTGGCGA TCTCACCGCC GACGGCCTCG ACGCCGTCGG GCGTCCCGCC TTCATCGCCA ACCACCTCGC TCATCATGGC TGCTCGCGCA ACGAGTTCTA TGAGTTCAAG GCGAGCGCGG AAGCCATGTC GGAGCGGGGC TGTTTGATGG AGCATCTCGG CTGCCGCGCG ACGCAGGCTG TCGGCGACTG CAATCAGCGG TCCTGGAACG GTGGCGGCTC CTGCACCAAG GGCGGCTATG CCTGCATCGC CTGCACCTCG CCTGGCTTCG AAAGCGCGCA GAACTATCTG CAGACCGCGA AGCTCGCCGG CATTCCCGTC GGGCTTCCGA CCGACATGCC CAAGGCGTGG TTCGTCGCGC TCGCGGCCCT GTCGAAATCG GCGACTCCGC GCCGCGTGCG CGTCAATGCC ACGGCTGATC ACGTCGTGGT GCCGCCGAGC CGCTCGGGCG ACAAGCGCAG TTCATGA
|
Protein sequence | MVGAEGMTRM LWLQGASCGG CTMSILESGA SGWFDELRQS GIDLVWHPSV SEETGEEAAE LLEAIRDGRE RLDLLVLEGS VARGPNLSGR FNMLAGTNRS IYHWLLDLAP LADYVVAVGS CAAYGGIPAA GINPTDAVGL QFEGSDVGGA LGAGFRSKRG LPVINVAGCA PHPGWIMESL LALTTGDLTA DGLDAVGRPA FIANHLAHHG CSRNEFYEFK ASAEAMSERG CLMEHLGCRA TQAVGDCNQR SWNGGGSCTK GGYACIACTS PGFESAQNYL QTAKLAGIPV GLPTDMPKAW FVALAALSKS ATPRRVRVNA TADHVVVPPS RSGDKRSS
|
| |