Gene Rsph17025_3376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3376 
Symbol 
ID5086036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp251541 
End bp252647 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID640484944 
Producthypothetical protein 
Protein accessionYP_001169561 
Protein GI146279403 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCAGA TCGAAACCTT CTATGACGTC ATGCGACGGC AGGGGATCAC CCGGCGCAGC 
TTCATGAAAT ACTGCTCGCT CACCGCGGCG GCGCTGGGGC TGGGCCCCTC GTTCGTGCCC
AAGATCGCCC ACGCGATGGA GACCAAGCCG CGGACGCCGG TGATCTGGGT CCACGGGCTC
GAGTGCACCT GCTGCTCGGA GAGCTTCATC CGCGCCGCGC ACCCGCTGGC CAAGGATGTC
GTGCTGTCGA TGATCTCGCT CGACTACGAC GACACGCTGA TGGCGGCGGC GGGGCATCAG
GCCGAAGAGG CGCTGATGGA CACGATCGAG AAATACAAGG GCAACTACAT CCTCGCGGTC
GAGGGCAACC CGCCGCTGAA CGAGGACGGG ATGTACTGCA TCATCGGCGG CAAGCCCTTC
GTGGAGCAGC TGAGGATGGC CGCCGAGCAC GCCAAGGCCA TCATCAGCTG GGGCGCCTGC
GCCTCCTACG GCTGCGTGCA GGCGGCGGCG CCCAACCCGA CGCGGGCGGT GCCGGTGCAC
AAGGTCATCC TCGACAAGCC GATCATCAAG GTGCCGGGCT GCCCGCCCAT CGCCGAGGTC
ATGACCGGCG TCATCACCTA CATGCTGACC TTCGACCGGC TGCCCGAGCT GGACCGTCAG
GGCCGCCCGG CGATGTTCTA CAGCCAGCGC ATCCACGACA AATGCTATCG CCGTCCGCAT
TTCGACGCCG GCCAGTTCGT CGAGCAATGG GACGATGACT ACGCCAAGAA GGGCTACTGC
CTTTACAAGA TGGGCTGCAA GGGTCCGACC ACCTACAACG CCTGCTCCAC CGTGCGCTGG
AACGAGGGGG TGAGCTTCCC GATCCAGTCG GGCCACGGCT GCATCGGCTG CTCGGAGGAC
GGGTTCTGGG ATCAGGGCTC GTTCTACGAC CGCGTGACCA ACATCAAGCA GTTCGGGGTC
GAGGCCAATG CGGATGCGAT CGGCCTGACG GCCGTGGGCG TCGTCGGCGC GGGCGTGGCG
CTGCATGTCG CGGCCACCGC GCTCAAGGCG GCGCAGCGCA AGTCGCAGAC CGCCAGAGAC
AAGAACAACG AGAAGACGGA GGCCTGA
 
Protein sequence
MPQIETFYDV MRRQGITRRS FMKYCSLTAA ALGLGPSFVP KIAHAMETKP RTPVIWVHGL 
ECTCCSESFI RAAHPLAKDV VLSMISLDYD DTLMAAAGHQ AEEALMDTIE KYKGNYILAV
EGNPPLNEDG MYCIIGGKPF VEQLRMAAEH AKAIISWGAC ASYGCVQAAA PNPTRAVPVH
KVILDKPIIK VPGCPPIAEV MTGVITYMLT FDRLPELDRQ GRPAMFYSQR IHDKCYRRPH
FDAGQFVEQW DDDYAKKGYC LYKMGCKGPT TYNACSTVRW NEGVSFPIQS GHGCIGCSED
GFWDQGSFYD RVTNIKQFGV EANADAIGLT AVGVVGAGVA LHVAATALKA AQRKSQTARD
KNNEKTEA