Gene Rsph17029_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2161 
Symbol 
ID4896222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2288920 
End bp2289942 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content73% 
IMG OID640112755 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001044036 
Protein GI126462922 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.072502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.073432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCA GGGACAGCCA TGTGACGCTC GCCCACGGCG GCGGCGGGAA AGCCATGCGC 
GATCTGATCG AGGAGGTGTT CACGAGCCTC TTCCAGCCGC CGGGGATGGA GGATCAGGCG
CGGCTGACCT CGGCGGCGCT GGCCGCGCCG GGCGCGCGGC TCGCGCTCAC CACCGACAGT
TTCGTCGTGA CCCCGCTCGA ATTTCCCGGC GGCGACATCG GCAAGCTCGC CATCTGCGGC
ACGGTCAACG ATCTCGCGGT GGGCGGGGCA GAGCCGCTCT GGCTCTCGGC CGCCTTCATC
ATCGAGGAGG GCACCGAGAT CGCGCTGCTG CGCCGGATCG CGGCCACCAT GGCGGACGAG
GCCCGGGCGG CCGGCGTGCG GATCGTGACG GGCGACACGA AGGTGGTGGA ACGTGGCGCG
GCCGACGGGC TCTTCATCAC CACGACCGGC GTGGGCGTGA TCCCGCCCGG GCGCGAGCTG
TCGGCCGCGG CGATCCGGCC GGGCGACCGG CTGCTCGTGA ACGGGGGCCT CGGCGATCAC
GGCGCCACCA TCCTCGCCGC GCGCGGGGAT CTGGCGCTCT CGACCGATCT CCAGTCGGAC
TGCGCCGCCC TCGGGCATCT GATGACGGCC GTGCTCAAGG CCGCTCCCGG TGCCCGGGCC
GCACGGGATG CGACCCGCGG CGGGGTCGCG GCGGTGCTGA ACGAGATGGC CGAGGCCTCG
GGCGTGGGGC TCGTCATCGA GGAGGAGGCG CTGCCGCTGC GGGCCGAGGT CGTGGGTCTT
TGCGAGATCC TCGGCCTCGA TCCGCTTTAT CTCGCCAACG AGGGGCGGCT CGTGGTCGTG
GTGCCGGAGG CGGAGGCCGA GGCGGCCCTC GGGGCCATGC GAGCCTGCCC CGAGGGCGCG
GGCGCGGTGG CCATCGGCCG CGCGGTCGCG GACCATCCGG GGCAGGTGCG CATGACCACC
CGCTTCGGCG GCAGCCGGAT CGTCGACATG CTGGTGGGCG AGCAACTGCC CCGCATTTGC
TGA
 
Protein sequence
MALRDSHVTL AHGGGGKAMR DLIEEVFTSL FQPPGMEDQA RLTSAALAAP GARLALTTDS 
FVVTPLEFPG GDIGKLAICG TVNDLAVGGA EPLWLSAAFI IEEGTEIALL RRIAATMADE
ARAAGVRIVT GDTKVVERGA ADGLFITTTG VGVIPPGREL SAAAIRPGDR LLVNGGLGDH
GATILAARGD LALSTDLQSD CAALGHLMTA VLKAAPGARA ARDATRGGVA AVLNEMAEAS
GVGLVIEEEA LPLRAEVVGL CEILGLDPLY LANEGRLVVV VPEAEAEAAL GAMRACPEGA
GAVAIGRAVA DHPGQVRMTT RFGGSRIVDM LVGEQLPRIC