Gene RPC_3772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3772 
Symbol 
ID3969365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4192927 
End bp4194051 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content65% 
IMG OID637926882 
Producthydrogenase (NiFe) small subunit (hydA) 
Protein accessionYP_533626 
Protein GI90425256 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.427783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCAG GGACAGAAAC ATTCTATGAG GTGATCCGCC GCCAAGGCAT CACCCGGCGC 
AGCTTCGTCA AATTCTGCAG CCTGACCGCG ACCAGCCTCG GGCTCGGCCC GATCGGCGCC
ACCGAGATCG CGCAGGCGCT GGAGACCAAG CCGCGGGTGC CGGTGATCTG GATGCACGGG
CTGGAATGCA CCTGCTGCTC GGAAAGCTTC ATCCGCTCGG CGCATCCTTT GGTCAAAGAC
GCCGTGCTGT CGATGATCTC GCTGGATTAC GACGACACCA TCATGGCGGC GGCCGGCCAT
CAGGCCGAAG CGATCCTGCA GGAGACCCGC GAGAAATACA AAGGCCAGTA CATCCTCGCG
GTGGAAGGCA ATCCGCCGCT CAACGAAGAC GGCATGTTCT GCATCGACGG CGGCCGCCCG
TTCGTCGAGA AGCTGAAGGA GATGGCCGAA GATTCCATGG CGGTGATCGC CTGGGGCGCC
TGCGCCTCCT GGGGCTGCGT GCAGGCGGCG AAGCCCAATC CGACCCAGGC CACCCCGATC
GACAAGGTGA TCCGCAACAA GCCGATCATC AAGGTGCCGG GCTGTCCGCC GATCGCCGAG
GTGATGACCG GCGTCGTCAC CTACATCACC ACTTTCGGCC GGCTGCCCGA GCTCGACCGC
CAGGGCCGGC CGAAAATGTT CTACTCGCAG CGCATCCACG ACAAATGCTA TCGCCGGCCG
CATTTCGACG CCGGCCAGTT CGTCGAAGAG TGGGACGACG ACGCCGCGCG AAAAGGCTAC
TGCCTGTACA AGATGGGCTG CAAGGGCCCG ACCACCTACA GCGCCTGTTC GACGGTGCGC
TGGAACGGCG GCGTCTCGTT CCCGATCCAA TCCGGCCACG GCTGCATCGG CTGCACCGAA
GATAATTTCT GGGACAACGG CTCGTTCTAC GACCGGCTGA CCACCATCAA GCAGTTCGGC
GTCGAGGCCA ACGCCGACAA GATCGGCGCC ACCGTAGCCG GCGTGGTCGG CACCGCGATC
GCCGCGCACG CCGCGGTCAC CACGGTGCGC AGCATGGCGA AACGTCGCAA GGAGAACGGC
GGCAACGGCA ACGGCAATAA ACCCAACGAC ACATCGGCCG GCTGA
 
Protein sequence
MGAGTETFYE VIRRQGITRR SFVKFCSLTA TSLGLGPIGA TEIAQALETK PRVPVIWMHG 
LECTCCSESF IRSAHPLVKD AVLSMISLDY DDTIMAAAGH QAEAILQETR EKYKGQYILA
VEGNPPLNED GMFCIDGGRP FVEKLKEMAE DSMAVIAWGA CASWGCVQAA KPNPTQATPI
DKVIRNKPII KVPGCPPIAE VMTGVVTYIT TFGRLPELDR QGRPKMFYSQ RIHDKCYRRP
HFDAGQFVEE WDDDAARKGY CLYKMGCKGP TTYSACSTVR WNGGVSFPIQ SGHGCIGCTE
DNFWDNGSFY DRLTTIKQFG VEANADKIGA TVAGVVGTAI AAHAAVTTVR SMAKRRKENG
GNGNGNKPND TSAG