Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3775 |
Symbol | |
ID | 3969462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4196728 |
End bp | 4197729 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637926885 |
Product | NADH ubiquinone oxidoreductase, 20 kDa subunit |
Protein accession | YP_533629 |
Protein GI | 90425259 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACCGA TCAACCTCTT ATGGCTGCAG GCCGCCGGCT GCGGCGGCTG CACCATGGCG ATCCTCGAGC AGGGCCGCGC CGGCTGGTTC GCCGAGCTCG CCTCGTTCGA CATCAACCTG CTGTGGCATC CGTCGGTCAG CGAGGCCACT GCCGACGACG TGATCGATCT GCTCGCCGCC GTGTCGGCCG CGCGCACGCC GTTGTCGGTG CTGGTGGTCG AGGGCGCGGT GCTGCGCGGA CCGAACGGCA GCGGCCGCTT CAACATGCTG GGCGGCACCG GCCGCTCGAT GGCGTCCTGG ATTTCCGAAT TGGCGCCGCG CGCCGACTAC GTGGTCGCAG TGGGAAGCTG TTCGGCCTAC GGCGGCGTGC CGGCGGCCGG GCACAATCCG ACCGATGCCT CCGGCTTGCA GTTTCTCGGC ACCGAACCGG GCGGCGTGCT CGGCGCGGCG TTCCGTTCCA AGGCGGGCCT GCCGGTGATC AACATCGCCG GCTGCGCGCC GCATCCCGGC TGGATCGCCG AGACGCTGGC CGCGCTGGCT TTGGGCGAAT TCTCCGCCGC GGCGCTGGAT AGTTTCGCAC GGCCAAAATT CTTCGCCGAG CATCTGGCGC ATCACGGCTG CGCCCGCAAC GAGTTCTACG AATTCAAGGC CAGCGCCGAG GCGATGTCGC AGCGCGGCTG TCTGATGGAG CATCTCGGCT GCAAGGCGAC GCAGGCGGTC GGCGATTGCA ACCAGCGCTC CTGGAACGGC GGCGGCTCCT GCACCCAGGC CGGCTATCCC TGCATCGCCT GCACCTCGCC GGGCTTCGAA GCCGCGCACA ACTACATGAC CACCGCGAAG GTCGCGGGCA TTCCCGTCGG CTTGCCGCTC GATATGCCAA AAGCCTGGTT CGTGGCGCTG GCGGCACTGT CGAAATCGGC GACGCCGAAG CGGGTGCGCG CCAACGCCAC CGCCGATCAC GTCGTGGTGC CGCCGACGCC GTCCGGACAT CGGCGCAAGT GA
|
Protein sequence | MGPINLLWLQ AAGCGGCTMA ILEQGRAGWF AELASFDINL LWHPSVSEAT ADDVIDLLAA VSAARTPLSV LVVEGAVLRG PNGSGRFNML GGTGRSMASW ISELAPRADY VVAVGSCSAY GGVPAAGHNP TDASGLQFLG TEPGGVLGAA FRSKAGLPVI NIAGCAPHPG WIAETLAALA LGEFSAAALD SFARPKFFAE HLAHHGCARN EFYEFKASAE AMSQRGCLME HLGCKATQAV GDCNQRSWNG GGSCTQAGYP CIACTSPGFE AAHNYMTTAK VAGIPVGLPL DMPKAWFVAL AALSKSATPK RVRANATADH VVVPPTPSGH RRK
|
| |