Gene RPC_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3775 
Symbol 
ID3969462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4196728 
End bp4197729 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content69% 
IMG OID637926885 
ProductNADH ubiquinone oxidoreductase, 20 kDa subunit 
Protein accessionYP_533629 
Protein GI90425259 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACCGA TCAACCTCTT ATGGCTGCAG GCCGCCGGCT GCGGCGGCTG CACCATGGCG 
ATCCTCGAGC AGGGCCGCGC CGGCTGGTTC GCCGAGCTCG CCTCGTTCGA CATCAACCTG
CTGTGGCATC CGTCGGTCAG CGAGGCCACT GCCGACGACG TGATCGATCT GCTCGCCGCC
GTGTCGGCCG CGCGCACGCC GTTGTCGGTG CTGGTGGTCG AGGGCGCGGT GCTGCGCGGA
CCGAACGGCA GCGGCCGCTT CAACATGCTG GGCGGCACCG GCCGCTCGAT GGCGTCCTGG
ATTTCCGAAT TGGCGCCGCG CGCCGACTAC GTGGTCGCAG TGGGAAGCTG TTCGGCCTAC
GGCGGCGTGC CGGCGGCCGG GCACAATCCG ACCGATGCCT CCGGCTTGCA GTTTCTCGGC
ACCGAACCGG GCGGCGTGCT CGGCGCGGCG TTCCGTTCCA AGGCGGGCCT GCCGGTGATC
AACATCGCCG GCTGCGCGCC GCATCCCGGC TGGATCGCCG AGACGCTGGC CGCGCTGGCT
TTGGGCGAAT TCTCCGCCGC GGCGCTGGAT AGTTTCGCAC GGCCAAAATT CTTCGCCGAG
CATCTGGCGC ATCACGGCTG CGCCCGCAAC GAGTTCTACG AATTCAAGGC CAGCGCCGAG
GCGATGTCGC AGCGCGGCTG TCTGATGGAG CATCTCGGCT GCAAGGCGAC GCAGGCGGTC
GGCGATTGCA ACCAGCGCTC CTGGAACGGC GGCGGCTCCT GCACCCAGGC CGGCTATCCC
TGCATCGCCT GCACCTCGCC GGGCTTCGAA GCCGCGCACA ACTACATGAC CACCGCGAAG
GTCGCGGGCA TTCCCGTCGG CTTGCCGCTC GATATGCCAA AAGCCTGGTT CGTGGCGCTG
GCGGCACTGT CGAAATCGGC GACGCCGAAG CGGGTGCGCG CCAACGCCAC CGCCGATCAC
GTCGTGGTGC CGCCGACGCC GTCCGGACAT CGGCGCAAGT GA
 
Protein sequence
MGPINLLWLQ AAGCGGCTMA ILEQGRAGWF AELASFDINL LWHPSVSEAT ADDVIDLLAA 
VSAARTPLSV LVVEGAVLRG PNGSGRFNML GGTGRSMASW ISELAPRADY VVAVGSCSAY
GGVPAAGHNP TDASGLQFLG TEPGGVLGAA FRSKAGLPVI NIAGCAPHPG WIAETLAALA
LGEFSAAALD SFARPKFFAE HLAHHGCARN EFYEFKASAE AMSQRGCLME HLGCKATQAV
GDCNQRSWNG GGSCTQAGYP CIACTSPGFE AAHNYMTTAK VAGIPVGLPL DMPKAWFVAL
AALSKSATPK RVRANATADH VVVPPTPSGH RRK