Gene Dret_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1077 
Symbol 
ID8418902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1268647 
End bp1269657 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content59% 
IMG OID645037649 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_003197943 
Protein GI258405201 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.26154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.1511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTTC CAGATACTGT GCTCCTTGAT TACGGCAGCG GGGGCAAAGC CTCCCAGCGG 
CTTATCAGCG AACTTTTTTT GAAACATTTT GACAATGCCA CCCTCAACCG GCTGGATGAC
GCGGCAATGC TGGACCTCAG CGGACCGCTT GCCGTGAGTA CCGACAGTTT TACCGTGGAC
CCGTTGTTCT TTCCTGGTGG TGATATCGGC TCTCTTGCCA TCCACGGCAC GGTCAATGAC
GTGGCCATGC TCGGGGCGCG GCCGATGTAT CTCAGTTGCG CGATGATCGT GGAAGAGGGA
TTGCCGTTTT CCACTTTGGA AGCCGTGGTT CGGTCTATGG CTGAAGCGTC CCGACATGCC
GGGGCCCAGA TCGTTACCGG GGACACCAAG GTCGTCCCCA AAGGGGCTGT GGATAAGCTT
TTTATCAATA CGACCGGGTT GGGACTGGTC CAGACGGCTT CCCCGCCCCA GGGCGACAGG
GCCCGCCCAG GCGACGCGAT CCTGCTGACC GGGACAATGG GTGACCACGG TCTGACGATT
TTAAGCCAGC GCCAGGGACT GGAATTCGAG ACTCCGGTGC AAAGCGATAG CGCCGCACTC
AATCATATGC TGCTTGATCT GGTTGAATCG GTGGGAGAAG TCCATGTCTT GCGCGACCCG
ACGCGCGGCG GCCTGGCGAC CACACTCAAC GAGATCGCAC TCCAATCCAA TCTGGGATTC
GTGATAGAGG AAAAGGCGGT TCCGGTCTCG GATGCGGTGC GTTCCGGTTG CTCGTTTCTG
GGACTTGACC CCTTGTATCT GGCCAATGAG GGCAAGGCCA TCTGCATTGT CCCTGAAGAT
CGCTTGGATG CGGCCTTGGC TTGCCTTCGC TCCCACGACG AAGGCCGCCA GGCCTGCCGA
GTCGGCACGG TGACCGAAGA CCATCCTGGA AAGGTGGTCT TGCAGACCCC GATCGGCGGC
AAACGCTTGC TGGATATGCT TGAAGGAGAG CAATTACCGA GGATTTGCTG A
 
Protein sequence
MSFPDTVLLD YGSGGKASQR LISELFLKHF DNATLNRLDD AAMLDLSGPL AVSTDSFTVD 
PLFFPGGDIG SLAIHGTVND VAMLGARPMY LSCAMIVEEG LPFSTLEAVV RSMAEASRHA
GAQIVTGDTK VVPKGAVDKL FINTTGLGLV QTASPPQGDR ARPGDAILLT GTMGDHGLTI
LSQRQGLEFE TPVQSDSAAL NHMLLDLVES VGEVHVLRDP TRGGLATTLN EIALQSNLGF
VIEEKAVPVS DAVRSGCSFL GLDPLYLANE GKAICIVPED RLDAALACLR SHDEGRQACR
VGTVTEDHPG KVVLQTPIGG KRLLDMLEGE QLPRIC