Gene Rpal_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1152 
Symbol 
ID6408808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1221959 
End bp1222960 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content70% 
IMG OID642711050 
ProductNADH ubiquinone oxidoreductase 20 kDa subunit 
Protein accessionYP_001990167 
Protein GI192289562 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTCCT TCGACATCCT TTGGCTGCAA GGAGCCAGCT GCGGCGGCTG CACCATGGCG 
GCGCTGGACG GCGGGCATTC CGGCTGGTTT GCCGAGCTGA AGCGGTTCGG CATCGATCTG
CTGTGGCATC CTTCCGTGAG CGAGGCGACC GCCGAAGAGG CGGTGGCGAT CTTCGAACGC
ATTGCGTCCG GCGAGCAGAA GCTCGGCGCG TTGGTGCTGG AAGGCGCCGT GCTGCGCGGG
CCAAATGGCA GTGGCCGGTT CAATATGCTC GGCGGCACCG GGCGTTCGAT GCTGCATTGG
GTGACGGCGC TGGCGCCGCG CGCCGATTAT GTGGTCGCGG CGGGAAGCTG CGCGGCGTTC
GGCGGCGTGC CGATGGCCGG CGGCAATCCG ACCGACGCCA GCGGCCTGCA ATATGCGGCG
GCAGAGCCAG GCGGCGTCCT CGGCACGGCC TTCCGCTCCC GCGCCGGGCT GCCGGTCATC
AACATCGCCG GCTGCGCGCC GCATCCCGGC TGGATTTCCG AAACACTGGC CGCGCTGGCG
CTTGGCGGTG TCGACACCGC CGCGCTCGAT GCGTTCGGCC GGCCGCGGTT CTTCGCCGAT
CATCTGGCGC ATCACGGCTG CGCCCGCAAC GAGTACTACG AGTTCAAGGC CAGCGCCGAG
GAGCTGTCGC AGCAGGGCTG CCTGATGGAG CATCTCGGCT GCAAGGCGAC CCAGGCGGTC
GGTGACTGCA ATCAGCGCGG CTGGAACGGC AGCGGCTCCT GCACCAGCGG CGGCTATGCC
TGCATCGCCT GCACCTCGCC GGGCTTCGAG TCCTCCCAGG GCTTCATGGA AACCGCCAAG
CTCGCCGGCA TTCCGGTCGG TCTGCCGCTG GACATGCCGA AGGCGTGGTT CGTCGCGCTG
GCGGCGCTGT CGAAATCGGC GACGCCGAAG CGGGTTCGCG CCAACGCGGC GGCGGATCAC
ATCGTGGTGC CGCCGCGCAG CGATCCCGGC CGGCGGCCAT GA
 
Protein sequence
MDSFDILWLQ GASCGGCTMA ALDGGHSGWF AELKRFGIDL LWHPSVSEAT AEEAVAIFER 
IASGEQKLGA LVLEGAVLRG PNGSGRFNML GGTGRSMLHW VTALAPRADY VVAAGSCAAF
GGVPMAGGNP TDASGLQYAA AEPGGVLGTA FRSRAGLPVI NIAGCAPHPG WISETLAALA
LGGVDTAALD AFGRPRFFAD HLAHHGCARN EYYEFKASAE ELSQQGCLME HLGCKATQAV
GDCNQRGWNG SGSCTSGGYA CIACTSPGFE SSQGFMETAK LAGIPVGLPL DMPKAWFVAL
AALSKSATPK RVRANAAADH IVVPPRSDPG RRP