Gene RSP_0495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_0495 
SymbolhupS 
ID3718446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp2231581 
End bp2232690 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content66% 
IMG OID640071703 
Producthydrogenase protein small subunit 
Protein accessionYP_353567 
Protein GI77464063 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCCAGA TCGAAACCTT CTACGATGTG ATGCGCCGCC AGGGGATCAC CCGGCGCAGC 
TTCATGAAAT ACTGCTCGCT CACGGCGGCG GCGCTGGGGC TCGGCCCCTC CTTCGTGCCG
AAGATCGCGC ACGCGATGGA GACGAAGCCG CGCACACCGG TGATCTGGGT CCATGGGCTC
GAATGCACCT GCTGCTCGGA GAGCTTCATC CGCGCGGCCC ATCCGCTGGC CAAGGACGTC
GTCCTGTCGA TGATCTCGCT CGACTATGAC GACACGCTGA TGGCGGCGGC GGGCCATCAG
GCCGAAGCCG CGCTGATGGA CACGATCGAG AAATACAAGG GCAACTACAT CCTTGCCGTC
GAGGGCAACC CGCCGCTGAA CGAGGACGGG ATGTATTGCA TCATCGGCGG CAAGCCCTTC
GTCGAGCAGC TGAAGATGGC GGCCGAACAT GCCAAGGCGA TCATCAGCTG GGGGGCCTGC
GCCTCCTACG GCTGCGTGCA GGCGGCCGCC CCCAACCCCA CGCGGGCCAC GCCCGTGCAC
AAGGTCATCC TCGACAAGCC GATCGTCAAG GTGCCGGGCT GCCCGCCCAT CGCCGAAGTC
ATGACCGGCG TCATCACCTA CATGCTGACC TTCGACCGGC TGCCCGAGCT CGACCGTCAG
GGCCGCCCGG CGATGTTCTA CAGCCAGCGC ATCCACGACA AATGCTACCG CCGCCCGCAT
TTCGACGCGG GCCAGTTCGT CGAGGCCTGG GACGACGACT ACGCCAAGAA GGGCTACTGC
CTCTACAAGA TGGGCTGCAA GGGGCCGACC ACCTACAACG CCTGCTCGAC CGTGCGCTGG
AACGAGGGCG TGAGCTTCCC GATCCAGTCC GGCCACGGCT GCATCGGCTG CTCGGAGGAC
GGCTTCTGGG ATCAGGGATC CTTCTACGAC CGGCTGACCA CCATCAAGCA GTTCGGCGTC
GAGGCCAATG CCGACACGAT CGGCCTCACG GCCGTGGGCG CGCTCGGCGC GGGCGTGGCG
GCCCATATCG CGGCCACCGC CCTCAAGAGC GCGCAGAAGA AATCGCAGGC GGCCAATACC
GCGAAGACAG ACGAAAAGAC GGAGGCCTGA
 
Protein sequence
MPQIETFYDV MRRQGITRRS FMKYCSLTAA ALGLGPSFVP KIAHAMETKP RTPVIWVHGL 
ECTCCSESFI RAAHPLAKDV VLSMISLDYD DTLMAAAGHQ AEAALMDTIE KYKGNYILAV
EGNPPLNEDG MYCIIGGKPF VEQLKMAAEH AKAIISWGAC ASYGCVQAAA PNPTRATPVH
KVILDKPIVK VPGCPPIAEV MTGVITYMLT FDRLPELDRQ GRPAMFYSQR IHDKCYRRPH
FDAGQFVEAW DDDYAKKGYC LYKMGCKGPT TYNACSTVRW NEGVSFPIQS GHGCIGCSED
GFWDQGSFYD RLTTIKQFGV EANADTIGLT AVGALGAGVA AHIAATALKS AQKKSQAANT
AKTDEKTEA