Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2143 |
Symbol | |
ID | 4896234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2271387 |
End bp | 2272379 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640112737 |
Product | NADH ubiquinone oxidoreductase, 20 kDa subunit |
Protein accession | YP_001044018 |
Protein GI | 126462904 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0412213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0704049 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTC TCTGGCTTCA GGCCTCGGGC TGCGGCGGCT GCACCATGTC GCTCCTCTGC GCCGAGGCGC CGGGCCTCTT CGACCTTCTC GAGGATGCAG GCCTGCGCTT CCTGTGGCAC CCCTCGCTTT CGGTCGAGTC CGGGGCGGAG GTGCGGGCGC TCCTCGACCG GATAGAGGCG GGAGCGCAGC CGCTCGACAT CCTCTGCGTC GAGGGCGCCA TCGCGCGCGG GCCCCGCGGC ACCGGACGGT TCCAGATGCT CGCGGGCACC GGGCGCTCGA TGCTCGAGAC CGTCACGCGG CTCGCGCCGC TCGCCCGGCA TGTGGTGGCG GTGGGCAGCT GCGCGGCCTA TGGCGGGATG ACGAGCGCGG GCGGCAACCC CTCGGACGCG ACGGGGCTCC AGTACGAGGG CACCCACGAG GGCGGCATCC TTCCGCCCGA GTTCCGGGCG CGGGACGGAC TGCCCGTGGT GAATGTGGCC GGTTGCCCGA CCCATCCGGG CTGGGTGACC GAGACGCTGA TGCTGCTGGG CGGGAACGCG CTGGCGGCCG GCGATCTCGA CCGGTTCGGC CGCCCGCGCT TCTATGCCGA TCATCTCGTG CATCACGGCT GCTCGCGCAA CGAATATTAC GAATACAAGG CCAGCGCCCG CGCCCCCGGC GAGATCGGCT GCATGATGGA ACATATGGGC TGCATCGGCA CGCAGGCGGT GGGCGACTGC AACATCCGGC CCTGGAACGG CAGCGGCTCC TGCACCTCGG GCGGCTATGC CTGCATCGCC TGCACCGCGC CCGAATTCGA GGAGCCGCGC CACCCCTACT CCGAGACGCC CAAGATCGGC GGCATCCCGG TGGGCCTGCC CTCGGACATG CCGAAGGCCT GGTTCATGGC GCTGGCGAGC CTGTCCAAGG CCGCCACGCC CGAGCGCATA CGGCGGAACG CGGCCTCCGA CCGGATCGAG GTGCCCCCCA CGCTCCGGAT GCCGAAGCGA TGA
|
Protein sequence | MNILWLQASG CGGCTMSLLC AEAPGLFDLL EDAGLRFLWH PSLSVESGAE VRALLDRIEA GAQPLDILCV EGAIARGPRG TGRFQMLAGT GRSMLETVTR LAPLARHVVA VGSCAAYGGM TSAGGNPSDA TGLQYEGTHE GGILPPEFRA RDGLPVVNVA GCPTHPGWVT ETLMLLGGNA LAAGDLDRFG RPRFYADHLV HHGCSRNEYY EYKASARAPG EIGCMMEHMG CIGTQAVGDC NIRPWNGSGS CTSGGYACIA CTAPEFEEPR HPYSETPKIG GIPVGLPSDM PKAWFMALAS LSKAATPERI RRNAASDRIE VPPTLRMPKR
|
| |