Gene Rsph17025_3379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3379 
Symbol 
ID5086039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp256546 
End bp257544 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content70% 
IMG OID640484947 
Producthypothetical protein 
Protein accessionYP_001169564 
Protein GI146279406 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATGA ACATCCTCTG GCTTCAGGCC TCGGGCTGCG GCGGCTGCAC CATGTCGCTC 
CTTTGCGCCG AGGCGCCCGG CGTCTTCGAC GCGCTGGAGG GAGCGGGCCT GCGGTTCCTC
TGGCACCCTG CCCTGTCGCT CGAGACCGGC GAGGAGGTGC GCGCCCTCTT GCGCCGGATC
GAGGCGGAAG AGATGCCGCT CGACATCCTT TGCGTCGAGG GCGCCGTCGC TCGCGGTCCG
CGCGGGACCG GGCGGTTCCA GATGCTGGCG GGCATGGGCA TCCCGATGCT CGAGGCGGTG
CGGCGGCTCG CACCGCTGGC GCGTCATGTC GTGGCGGTGG GAACCTGCGC GGCCTATGGC
GGGATGACGA GCGCGGGCGG CAACCCGTCC GATGCGGTGG GCCTGCAATA TGAGGGCGCG
CATCCCGGGG GCGCGCTGCC GTCCGCCTTC CGCGCCCGGG GCGGGCTCAA GGTGATCAAT
GTCGCAGGCT GCCCCACCCA TCCCGGCTGG GTGATCGAGA CGCTGATGAT GCTGGCGGCG
GGGGGTTTGG CGGAAGAGGC GTTCGACCGG TTCGGCCGCC CGCGCTTTTA CGCCGATCAC
CTCGTCCACC ACGGCTGTCC GAAGAACGAA TATTACGAAT ACAAGGCCAG CGCCCGCGCC
CCCGGCGAGA TCGGCTGCAT GATGGAGCAT ATGGGCTGCA TCGGCACGCA GGCGGTGGGC
GACTGCAACA TCCGGCCGTG GAACGGGGCG GGCTCCTGCA CCTCGGCCGG CTATGCCTGC
ATCGCTTGCA CCGCGCCCGA GTTCGAAGAG CCGCGCCACC CCTATTCCGA GACGCCGAAG
ATCGGCGGCA TCCCGGTGGG TCTGCCGTCG GACATGCCGA AGGCCTGGTT CATGGCGCTG
GCGAGCCTGT CAAAGGCCGC CACGCCCGAC CGCATCCGCC GCAATGCGGT GGCCGACCGC
ATCGAGGTGC CGCCGAACCT CAAGGGCCTC AAGAGATGA
 
Protein sequence
MRMNILWLQA SGCGGCTMSL LCAEAPGVFD ALEGAGLRFL WHPALSLETG EEVRALLRRI 
EAEEMPLDIL CVEGAVARGP RGTGRFQMLA GMGIPMLEAV RRLAPLARHV VAVGTCAAYG
GMTSAGGNPS DAVGLQYEGA HPGGALPSAF RARGGLKVIN VAGCPTHPGW VIETLMMLAA
GGLAEEAFDR FGRPRFYADH LVHHGCPKNE YYEYKASARA PGEIGCMMEH MGCIGTQAVG
DCNIRPWNGA GSCTSAGYAC IACTAPEFEE PRHPYSETPK IGGIPVGLPS DMPKAWFMAL
ASLSKAATPD RIRRNAVADR IEVPPNLKGL KR