Gene Rsph17029_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2146 
Symbol 
ID4897103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2276478 
End bp2277587 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content66% 
IMG OID640112740 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_001044021 
Protein GI126462907 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.206578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.329607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCAGA TCGAAACCTT CTACGATGTG ATGCGCCGCC AGGGGATCAC CCGGCGCAGC 
TTCATGAAAT ACTGCTCGCT CACCGCGGCG GCGCTGGGGC TCGGCCCTTC CTTCGTGCCG
AAGATCGCGC ATGCGATGGA GACGAAGCCC CGCACGCCGG TGATCTGGGT CCACGGGCTC
GAATGCACCT GCTGCTCGGA GAGCTTCATC CGCGCGGCCC ATCCGCTGGC CAAGGACGTC
GTCCTGTCGA TGATCTCGCT CGACTATGAC GACACGCTGA TGGCGGCGGC GGGCCATCAG
GCCGAAGCCG CGCTGATGGA CACGATCGAG AAATACAAGG GCAACTACAT CCTTGCCGTC
GAGGGCAACC CGCCGCTGAA CGAGGACGGG ATGTACTGCA TCATCGGGGG CAAGCCCTTC
GTCGAGCAGC TGAAGATGGC GGCCGAGCAC GCCAAGGCCA TCATCAGCTG GGGGGCCTGC
GCCTCCTACG GCTGCGTGCA GGCGGCCGCG CCCAACCCGA CGCGGGCCAC GCCCGTGCAC
AAGGTCATCC TCGACAAGCC GATCATCAAG GTGCCGGGCT GCCCGCCCAT CGCCGAAGTC
ATGACCGGCG TCATCACCTA CATGCTGACC TTCGACCGTC TGCCCGAGCT CGACCGTCAG
GGCCGCCCGG CGATGTTCTA CAGCCAGCGC ATCCACGACA AATGCTACCG CCGCCCGCAT
TTCGACGCGG GCCAGTTCGT CGAGGCCTGG GACGACGACT ACGCCAAGAA GGGCTACTGC
CTCTACAAGA TGGGCTGCAA GGGCCCGACC ACCTACAACG CCTGCTCGAC CGTGCGCTGG
AACGAGGGCG TGAGCTTCCC GATCCAGTCC GGCCACGGCT GCATCGGCTG CTCGGAGGAC
GGCTTCTGGG ATCAGGGGTC CTTCTACGAC CGGCTGACCA CCATCAAGCA GTTCGGCGTC
GAGGCCAATG CCGACACGAT CGGCCTCACG GCCGTGGGCG CGCTCGGCGC GGGCGTGGCG
GCCCATATCG CGGCCACCGC CCTCAAGAGC GCGCAGAAGA AATCGCAGGC GGCCAATACC
GCGAAGACAG ACGAAAAGAC GGAGGCCTGA
 
Protein sequence
MPQIETFYDV MRRQGITRRS FMKYCSLTAA ALGLGPSFVP KIAHAMETKP RTPVIWVHGL 
ECTCCSESFI RAAHPLAKDV VLSMISLDYD DTLMAAAGHQ AEAALMDTIE KYKGNYILAV
EGNPPLNEDG MYCIIGGKPF VEQLKMAAEH AKAIISWGAC ASYGCVQAAA PNPTRATPVH
KVILDKPIIK VPGCPPIAEV MTGVITYMLT FDRLPELDRQ GRPAMFYSQR IHDKCYRRPH
FDAGQFVEAW DDDYAKKGYC LYKMGCKGPT TYNACSTVRW NEGVSFPIQS GHGCIGCSED
GFWDQGSFYD RLTTIKQFGV EANADTIGLT AVGALGAGVA AHIAATALKS AQKKSQAANT
AKTDEKTEA