Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1152 |
Symbol | |
ID | 6408808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 1221959 |
End bp | 1222960 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642711050 |
Product | NADH ubiquinone oxidoreductase 20 kDa subunit |
Protein accession | YP_001990167 |
Protein GI | 192289562 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTCCT TCGACATCCT TTGGCTGCAA GGAGCCAGCT GCGGCGGCTG CACCATGGCG GCGCTGGACG GCGGGCATTC CGGCTGGTTT GCCGAGCTGA AGCGGTTCGG CATCGATCTG CTGTGGCATC CTTCCGTGAG CGAGGCGACC GCCGAAGAGG CGGTGGCGAT CTTCGAACGC ATTGCGTCCG GCGAGCAGAA GCTCGGCGCG TTGGTGCTGG AAGGCGCCGT GCTGCGCGGG CCAAATGGCA GTGGCCGGTT CAATATGCTC GGCGGCACCG GGCGTTCGAT GCTGCATTGG GTGACGGCGC TGGCGCCGCG CGCCGATTAT GTGGTCGCGG CGGGAAGCTG CGCGGCGTTC GGCGGCGTGC CGATGGCCGG CGGCAATCCG ACCGACGCCA GCGGCCTGCA ATATGCGGCG GCAGAGCCAG GCGGCGTCCT CGGCACGGCC TTCCGCTCCC GCGCCGGGCT GCCGGTCATC AACATCGCCG GCTGCGCGCC GCATCCCGGC TGGATTTCCG AAACACTGGC CGCGCTGGCG CTTGGCGGTG TCGACACCGC CGCGCTCGAT GCGTTCGGCC GGCCGCGGTT CTTCGCCGAT CATCTGGCGC ATCACGGCTG CGCCCGCAAC GAGTACTACG AGTTCAAGGC CAGCGCCGAG GAGCTGTCGC AGCAGGGCTG CCTGATGGAG CATCTCGGCT GCAAGGCGAC CCAGGCGGTC GGTGACTGCA ATCAGCGCGG CTGGAACGGC AGCGGCTCCT GCACCAGCGG CGGCTATGCC TGCATCGCCT GCACCTCGCC GGGCTTCGAG TCCTCCCAGG GCTTCATGGA AACCGCCAAG CTCGCCGGCA TTCCGGTCGG TCTGCCGCTG GACATGCCGA AGGCGTGGTT CGTCGCGCTG GCGGCGCTGT CGAAATCGGC GACGCCGAAG CGGGTTCGCG CCAACGCGGC GGCGGATCAC ATCGTGGTGC CGCCGCGCAG CGATCCCGGC CGGCGGCCAT GA
|
Protein sequence | MDSFDILWLQ GASCGGCTMA ALDGGHSGWF AELKRFGIDL LWHPSVSEAT AEEAVAIFER IASGEQKLGA LVLEGAVLRG PNGSGRFNML GGTGRSMLHW VTALAPRADY VVAAGSCAAF GGVPMAGGNP TDASGLQYAA AEPGGVLGTA FRSRAGLPVI NIAGCAPHPG WISETLAALA LGGVDTAALD AFGRPRFFAD HLAHHGCARN EYYEFKASAE ELSQQGCLME HLGCKATQAV GDCNQRGWNG SGSCTSGGYA CIACTSPGFE SSQGFMETAK LAGIPVGLPL DMPKAWFVAL AALSKSATPK RVRANAAADH IVVPPRSDPG RRP
|
| |