Gene Rpal_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1154 
Symbol 
ID6408810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1224638 
End bp1225750 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content63% 
IMG OID642711052 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_001990169 
Protein GI192289564 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGCAG TGACGGAAAC CTTTTACGAG GTGATCAGGC GGCAGGGGAT CACGCGGCGC 
AGCTTCGTGA AATTCTGCAG TCTCACGGCC ACCAGTCTCG GCCTCGGGCC GATCGGTGCC
ACCCAGATCG CGCATGCGCT GGAGACCAAG CCGCGCGTGC CGGTGATCTG GATGCACGGG
CTGGAATGCA CCTGCTGCTC GGAAAGCTTC ATCCGCTCGG CGCATCCTCT GGTGAAGGAT
GCGGTGCTGT CGATGATCTC GCTCGATTAT GACGACACCA TCATGGCGGC GGCGGGTCAT
CAGGCCGACG CGATCCTCGA AGAGACTCGC AAGAAGTATA AAGGCCAGTA CGTGCTGGCG
GTGGAGGGCA ATCCGCCGCT GAACGAAGAC GGCATGTTCT GCATCGACGG CGGCCGCCCG
TTCGTCGAGA AACTGAAGGA AATGGCCGAA GACTCGATGG CGGTGATCGC CTGGGGGAGC
TGCGCCTCCT GGGGCTGCGT ACAGGCCGCC AAGCCCAACC CGACCAATGC CACCCCGATC
GACAAGGTGA TCCGCAACAA GCCGGTGATC AAGGTGCCGG GCTGTCCGCC GATCGCCGAA
GTCATGACCG GCGTTGTCTC CTACATCATC ACCTTTGGAC GGTTGCCCGA GCTTGACCGC
CAGGGCCGGC CGAAGATGTT CTACTCGCAG CGCATCCACG ACAAATGCTA TCGCCGGCCG
CATTTCGACG CCGGTCAGTT CGTCGAGGAA TGGGACGACG ACGGCGCGCG CAAAGGCTAC
TGCCTCTACA AGATGGGCTG CAAGGGCCCG ACTACCTACA ACGCCTGTTC GACGGTGCGC
TGGAACGGCG GCGTGTCGTT CCCGATCCAG TCCGGCCATG GTTGCATCGG CTGCTCGGAA
GACGCGTTCT GGGACAAGGG CTCGTTCTAC GACCGGCTCA CCACCATCAA TCAGTTCGGT
ATCGAGGCCA ACGCCGACAA GATCGGCGCC ACGGTCGCCG GCGTCGTCGG CACGGCGATC
GCCGCGCACG CCGCGGTGAC CACCGTGCGC AATCTGTCGC GCCGCAAGGA AGTCCCGAAC
GGCAACGGCA CCTCCAACGG CAAGTCGGCT TAA
 
Protein sequence
MGAVTETFYE VIRRQGITRR SFVKFCSLTA TSLGLGPIGA TQIAHALETK PRVPVIWMHG 
LECTCCSESF IRSAHPLVKD AVLSMISLDY DDTIMAAAGH QADAILEETR KKYKGQYVLA
VEGNPPLNED GMFCIDGGRP FVEKLKEMAE DSMAVIAWGS CASWGCVQAA KPNPTNATPI
DKVIRNKPVI KVPGCPPIAE VMTGVVSYII TFGRLPELDR QGRPKMFYSQ RIHDKCYRRP
HFDAGQFVEE WDDDGARKGY CLYKMGCKGP TTYNACSTVR WNGGVSFPIQ SGHGCIGCSE
DAFWDKGSFY DRLTTINQFG IEANADKIGA TVAGVVGTAI AAHAAVTTVR NLSRRKEVPN
GNGTSNGKSA