Gene Rcas_3795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3795 
Symbol 
ID5541297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4967038 
End bp4968096 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID640895905 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001433852 
Protein GI156743723 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.717086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.875312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAG GACTGAACTT CGAGGGATGG TCTTGTCCGT TGCCGTTGCG CGATCACCCG 
AACATCGTGA TGGGGCATGG CGGCGGCGGT AAACTCTCGG CGGAACTGGT CGAGCATCTG
TTTCTGCCAG CATTTGCGGA TTCAGGTGCA GTCGATATGG GAGATGCGGC GCTCATTGCG
GTTGGCGGGG CGCACCTGGC GTTTTCGACC GACTCGTTCG TCGTGCGGCC CCTCTTCTTT
CCCGGTGGCA ACATCGGCGA ACTGGCGGTC AACGGCACGA TTAACGACAT CGCCATGCGC
GGCGCGCAGC CGCTTGTCCT CAGCGCCGGA TTCATTCTGG AAGAAGGGCT GCCGCTCGAT
CAACTCGCCG CAATTGCGCA CAGTATGGGT GTGGCTGCGC GCCGCGCCGG TGTCACCCTT
GTGGCCGGTG ATACCAAAGT CGTCGATCGT GGGCATGGCG ACGGCGTCTA TATCAACACC
AGCGGCTTTG GCATTGTGCC GGAGGGGATC GACATTGGAC CGACGCGGGC GCAACCGGGG
GATGCGATCA TCGTCAGCGG CACGATTGGC GATCACGGCA TTGCCATTCT CAGCGTGCGC
GAAGGGCTTG AGTTTGGCGC AACCGTCGAA TCCGACACTG CGCCGCTCAA CGGGCTGGTC
GCCGATCTGC TGGACGAAAC GCGCAATATC CACGTCCTGC GTGATCCGAC GCGCGGCGGA
GTGGCGTCGG CGCTCAACGA AATCGCGCGT GCCTCACAGG TCGGTATTGT GATCGACGAG
CGTAACCTGC CGGTGCAGGA CGCTGTGCGC GCTGCATGCG AATTGCTCGG CATGGACCCG
CTCTATGTGG CGAACGAGGG GAAACTGATT GCGATTGCGC CTGCTTCCGA CGCCGAACGC
CTGTTGGCGC GCATGCGCGC GCATCCGTTG GGACGGCAGG CCGCCATCAT TGGGCGTGTC
ACCGCTGACC ATCCGGGGTT GGTGGCGGCG CGCACCGGCA TTGGCGGGAC GCGCATTGTC
GATATGATGG TTGGCGAACA GTTGCCGCGG ATTTGCTGA
 
Protein sequence
MSEGLNFEGW SCPLPLRDHP NIVMGHGGGG KLSAELVEHL FLPAFADSGA VDMGDAALIA 
VGGAHLAFST DSFVVRPLFF PGGNIGELAV NGTINDIAMR GAQPLVLSAG FILEEGLPLD
QLAAIAHSMG VAARRAGVTL VAGDTKVVDR GHGDGVYINT SGFGIVPEGI DIGPTRAQPG
DAIIVSGTIG DHGIAILSVR EGLEFGATVE SDTAPLNGLV ADLLDETRNI HVLRDPTRGG
VASALNEIAR ASQVGIVIDE RNLPVQDAVR AACELLGMDP LYVANEGKLI AIAPASDAER
LLARMRAHPL GRQAAIIGRV TADHPGLVAA RTGIGGTRIV DMMVGEQLPR IC