Gene Rcas_3796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3796 
Symbol 
ID5541298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4968093 
End bp4969208 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content62% 
IMG OID640895906 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_001433853 
Protein GI156743724 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.44763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.895026 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACC TGGACGAATA TCGCGACCCG GATCTGGCGC AGCGGCTTTT TGCCGAGATT 
CGTCGCATCA CGACGCGCCC CTGGGCGATC ATGGAAGTGT GCGGCGGGCA GACGCATTCG
ATTATCCGCA ACGGAATCGA TCAACTGTTG CCAGAAGCGA TTGAGCTGAT CCACGGTCCT
GGCTGCCCGG TGTGCGTGAC GCCGCTGGAG ATTATCGACA AGGCGCTGGC AATCGCTTCC
CGTCCCAACG TTATTTTCTG TTCGTTTGGC GATATGCTGC GCGTGCCGGG GAGTGCGAAA
GACCTGTTCC GCGTCAAGAG CGAAGGCGGT GATGTGCGTG TGGTGTATTC ACCGCTCGAT
GCCGTGCGCC TGGCGCAGCA GCACCCCGAC CGTGAGGTGG TCTTCTTTGG CATTGGTTTC
GAGACGACTG CTCCCGCCAA CGCAATGGCA GTGTTGCAGG CGCACCGCCT GGGATTGCGC
AATTTTTCGA TGCTGGTGTC GCACGTGCTG GTGCCACCCG CCATCTCCGC CATCATGGAG
TCGCCGACGA ACCGTGTGCA AGGATTCCTG GCAGCCGGGC ATGTGTGCAG CGTGATGGGC
ACCTGGCAGT ACCGGCCGCT GGTTGAACGG TACCATGTGC CAATTGTTGT CACCGGTTTC
GAGCCGCTCG ACGTACTGGA AGGGATCCGC CGCGTCGTTC TGCAACTGGA AGCGGGACGC
GCCGAACTCG ACAACGCCTA TGAGCGTGCC GTGCGACCGG AAGGGAACGT CGCGGCGCAA
CAGGTGCTGT CTGAGGTCTT TGAGGTGACC GACCGGGCAT GGCGCGGCAT CGGCGTTATC
CCGCAGAGCG GTTGGCGCCT GCGCAACGCC TACCGCGCCT ACGACGCCGA GGCGCGCTTC
GCGGTTGGCG ACATTCAGAC GCGCGAGTCG CCAATCTGCC GCAGCGGTGA GGTGCTTCAG
GGTATGCTCA AGCCCAATCA GTGCCCGGCG TTCGGCAAGG AATGCACCCC GCGCACGCCC
CTTGGCGCAA CGATGGTGTC GAGCGAAGGG GCATGCGCTG CGTACTATCA GTACGGACGG
TTCATCAAGG CTGAAGAGGT GGGAGTCGCG CGATGA
 
Protein sequence
MKYLDEYRDP DLAQRLFAEI RRITTRPWAI MEVCGGQTHS IIRNGIDQLL PEAIELIHGP 
GCPVCVTPLE IIDKALAIAS RPNVIFCSFG DMLRVPGSAK DLFRVKSEGG DVRVVYSPLD
AVRLAQQHPD REVVFFGIGF ETTAPANAMA VLQAHRLGLR NFSMLVSHVL VPPAISAIME
SPTNRVQGFL AAGHVCSVMG TWQYRPLVER YHVPIVVTGF EPLDVLEGIR RVVLQLEAGR
AELDNAYERA VRPEGNVAAQ QVLSEVFEVT DRAWRGIGVI PQSGWRLRNA YRAYDAEARF
AVGDIQTRES PICRSGEVLQ GMLKPNQCPA FGKECTPRTP LGATMVSSEG ACAAYYQYGR
FIKAEEVGVA R