Gene RPC_4556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4556 
Symbol 
ID3971836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5087869 
End bp5088903 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content68% 
IMG OID637927667 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_534397 
Protein GI90426027 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGT TGATTGGGAC GGTGGTGACG CTGGCGCATG GCGGCGGCGG CAAGGCGATG 
CGCGACCTCG TCGCCGAAGT GTTCATGCCG GCGTTCGACA ATCCGATTCT CGGCGTCATG
GAAGATCAGG CGCGGCTCGA GCTCGGCGCA CTGTCGGTGC CTGGCGCGCG GCTCGCCTTC
ACCACCGACG GCTTCGTGGT GCGACCGCTG GAATTTCCCG GCGGCGATAT CGGCAAGCTC
GCGGTGTGCG GCACGGTGAA CGATCTCGCA GTCGGCGGCG CCACGCCGCT GTGGCTGTCC
TGCGCGGTGG TGATCGAGGA AGGTTTTGAA TTCGAAAAGC TGCGCCGCAT CGTGGCGTCG
ATGCGGGCCT CGGCGGCCGA GGCCGGCGTG CTGATCGTCA CCGGCGACAC CAAGGTGGTG
GAGCGCGGCG CGGCCGACGG ATTGTTCATT ACCACCGCCG GCGTCGGGGT GTTTCCGCCG
GGGCGCAATC TCGCCGCGGT GAATGTGCGG CCGGGCGACG TGGTGCTGGT CAATGGCCCG
ATCGGCGATC ACGGCGCCGC GGTGATGGCC GCGCGCGGCG ATCTATTGCT GGAAACCAAG
TTGATCAGCG ATTGCCGTCC GCTCAACGGC CTGATGAATG CGCTGCTTGA GGCCGCGCCC
GCCACGCGTT GCGCGCGCGA TGCGACCCGC GGCGGCGTCG CCGCGGTGCT GAACGAAATC
GCGGCGGCCG CCAAGATCGG CGTCGCGATC GACGAGGCGA AGATTCCGGT GCGTCCCGAA
GTCGACGGCG TCTGCGAAAT TTTGGGCCTC GATCCCTTGT ATCTCGCCAA TGAAGGCACC
TTGGTCGCGG TGGTGCCGGC CGATCAGGCC GACGCCGCGC TGGCGGCGTG GCGTGCCCGC
GCCGATGGAA CTGATGCTTG CGCGATCGGC GAGATCACCG CGGGGCCGTC CGGCGACGTG
GTGATGCGCA CCCGGTTCGG CAGCGAGCGA CTGGTCGATC TGATGATCGG CGATCAATTG
CCGCGGATTT GCTGA
 
Protein sequence
MSKLIGTVVT LAHGGGGKAM RDLVAEVFMP AFDNPILGVM EDQARLELGA LSVPGARLAF 
TTDGFVVRPL EFPGGDIGKL AVCGTVNDLA VGGATPLWLS CAVVIEEGFE FEKLRRIVAS
MRASAAEAGV LIVTGDTKVV ERGAADGLFI TTAGVGVFPP GRNLAAVNVR PGDVVLVNGP
IGDHGAAVMA ARGDLLLETK LISDCRPLNG LMNALLEAAP ATRCARDATR GGVAAVLNEI
AAAAKIGVAI DEAKIPVRPE VDGVCEILGL DPLYLANEGT LVAVVPADQA DAALAAWRAR
ADGTDACAIG EITAGPSGDV VMRTRFGSER LVDLMIGDQL PRIC