Gene RPC_3759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3759 
Symbol 
ID3969352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4183229 
End bp4184359 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content67% 
IMG OID637926869 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_533613 
Protein GI90425243 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATG TCGACGAGTA TCGCGACGGC CAACTCGCCC GAGGCCTCGC CGCCACCATC 
GCCCGCGACA GCGATCCCGC GCGCAACTAC GCGCTGATGG AATTCTGCGG CGGCCACACC
CACGCGATCT CGCGCTACGG CCTCGAGGAT CTGTTGCCGG GCAACGTTCG CATGGTGCAC
GGTCCCGGCT GCCCGGTCTG CGTGCTGCCG ATCGGCCGCA TCGACATGGC GCTGCAGCTC
GCGACCCGGC CGAACGTGAC GCTGTGTTGT TACGGCGACC TGATGCGGGT GCCGGGCTCG
CGTGGCAACT CGCTGCTGCG CGCCAAGGCG GCGGGCGCCG ACATCCGCAT GGTGTATTCG
ACGCTCGATG CGCTCGCGCT CGCCGAGGCC GAGCCGTCGC GCGACGTGGT GTTCTTCGCC
ATCGGTTTCG AGACCACCAC GCCGCCGACC GCGCTGGCGG TCCGGCTGGC GCAGAAGCGG
GGCCTCACCA ATTTCAGCGT GTTCTGCAAC CACGTGCTGA CGCCGTCGGC GATCCAGCGG
ATTCTCGCTT GCGACAGCGA CGGTGTGCAC ATCGATGGCC TGGTCGGTCC GGCGCATGTC
TCCACCGTGA TCGGCACCGC GCCGTTCTCA CGCTTCGCAA CCGAGTTCGC CAAGCCGGTG
GTGGTGGCGG GCTTCGAGCC GCTCGACGTG ATGCAGGCGA TATTGATGCT GATCCGCCAG
GTCAATGACG GCCGCGCCGA GGTGGAAAAC CAGTACATCC GCGCGGTGAC GCCCGACGGC
AACCGGATCG CGCAGGGCGA AGTCGCGGAT ATCTTCGAAT TGCGTGAGAG TTTCGAGTGG
CGCGGGCTCG GCCAGATCCC CGCCAGCGCG CTGCGCTTGA AACCAGCCTA TGCCGGCTTC
GACGCCGAGC GGCGCTTTGC GCTCGACGAT ATGTCCGCCA GTGACAACCC GGCCTGCGAA
TGCGGCGCCA TCCTGCGCGG CGTCAAGCGC CCCGCCGAAT GCCGATTGTT CGGCAAAGCA
TGCACGCCGG AGAGTCCAAT GGGCTCCTGC ATGGTGTCGT CGGAAGGCGC CTGCGCGGCG
CATTGGAGCT ATGGCCGGTT TCGCGACCAC GCCCGGAGGA AAACCGCATG A
 
Protein sequence
MKYVDEYRDG QLARGLAATI ARDSDPARNY ALMEFCGGHT HAISRYGLED LLPGNVRMVH 
GPGCPVCVLP IGRIDMALQL ATRPNVTLCC YGDLMRVPGS RGNSLLRAKA AGADIRMVYS
TLDALALAEA EPSRDVVFFA IGFETTTPPT ALAVRLAQKR GLTNFSVFCN HVLTPSAIQR
ILACDSDGVH IDGLVGPAHV STVIGTAPFS RFATEFAKPV VVAGFEPLDV MQAILMLIRQ
VNDGRAEVEN QYIRAVTPDG NRIAQGEVAD IFELRESFEW RGLGQIPASA LRLKPAYAGF
DAERRFALDD MSASDNPACE CGAILRGVKR PAECRLFGKA CTPESPMGSC MVSSEGACAA
HWSYGRFRDH ARRKTA