Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1160 |
Symbol | |
ID | 4021636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1319752 |
End bp | 1320753 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637961352 |
Product | NADH ubiquinone oxidoreductase, 20 kDa subunit |
Protein accession | YP_568299 |
Protein GI | 91975640 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCCGT TCAATGTGCT GTGGCTGCAG GGCGCGAGCT GCGGCGGCTG CACCATGGCG GCGCTCGACA ACGGCCACTC CGGCTGGTTC GCCGACCTCG CCCGTTTCGG CATCGATCTG ATCTGGCATC CCTCGCTGAG CGAGGCGACC GCAGACGAAG CGGTCGCGAT CTTCGAGCGC GTGGCGGACG GCGCGCAGCG GCTCGACGCG CTGGTGCTGG AAGGCGCGGT GCTGCGCGGG CCGAACGGCA CCGGCCGCTT CAACATGCTC GGCGGCACCC ATCGCGCGAT GCTGCATTGG GTCCGCGCGC TCGCGCCCTG CGCGAACTAC GTCGTCGCCG CCGGGAGTTG CGCTGCATTC GGCGGCGTGC CGATGGCCGG CAGCAATCCG ACCGACGCCA GCGGCCTGCA GTTTGCGGGC GTGGAAGCAG GCGGCGCGCT CGGCGCGGCG TTTCGATCCC GCGCCGGCCT GCCGGTGATC AACATCGCCG GCTGCGCGCC GCATCCGGGC TGGATTGCCG AAACACTGGC GGCGCTGGCG CTCGGCGGTT TCGACAGCGA AGCGCTCGAC AGCTTCGGCC GGCCGCGATT CTACGCCGAT CACCTCGCGC ATCACGGCTG CGCCCGCAAC GAGTATTACG AGTTCAAGGC CAGCGCCGAG ACGCTGTCGC AGCAAGGCTG CCTGATGGAG CATCTCGGCT GCAAGGCGAC CCAGGCGGTC GGCGACTGCA ACCAGCGCGG CTGGAACGGC GGCGGCTCCT GCACCAGCGG CGGCGGCGCC TGCATCGCGT GTACGTCGCC CGGCTTCGAG GCATCGCAGA ACTTCATGGA GACCGCCAAG CTCGGCGGCA TTCCGGTCGG CCTGCCGCTC GACATGCCGA AGGCGTGGTT CGTGGCGCTG GCCGCGTTGT CGAAATCGGC GACGCCGAAA CGGGTGCGCG CCAATGCGAC CGCCGACCAT GTGATCGTGC CGCCGCGCAC TGATCACGGA CGCCGCAAAT GA
|
Protein sequence | MEPFNVLWLQ GASCGGCTMA ALDNGHSGWF ADLARFGIDL IWHPSLSEAT ADEAVAIFER VADGAQRLDA LVLEGAVLRG PNGTGRFNML GGTHRAMLHW VRALAPCANY VVAAGSCAAF GGVPMAGSNP TDASGLQFAG VEAGGALGAA FRSRAGLPVI NIAGCAPHPG WIAETLAALA LGGFDSEALD SFGRPRFYAD HLAHHGCARN EYYEFKASAE TLSQQGCLME HLGCKATQAV GDCNQRGWNG GGSCTSGGGA CIACTSPGFE ASQNFMETAK LGGIPVGLPL DMPKAWFVAL AALSKSATPK RVRANATADH VIVPPRTDHG RRK
|
| |