Gene Dret_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0422 
Symbol 
ID8418227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp517549 
End bp518616 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content63% 
IMG OID645036983 
Productcoenzyme F420 hydrogenase/dehydrogenase beta subunit domain protein 
Protein accessionYP_003197297 
Protein GI258404555 
COG category[C] Energy production and conversion 
COG ID[COG1035] Coenzyme F420-reducing hydrogenase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.664623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCT TTTTCGACTT GCTCAGAGAC ATCATCCACC CTGGATATTG CCGCCGTTGC 
GGCGGGTGTG TCGCGTTTTG CCAGGCCAAC AATTACGGGG CCTTGGAAAC AAGTCCCGAG
GGGTGGCCGC AATTTCGCAA CGCGGAACGG TGCCTGGAAT GCGGCGTCTG TTATCAGATC
TGCCCGGCCA CGGGACAGTT GACCGCAGAA ACCCGCCGTC GCGTGGCCTG GTCAGCTCCC
ATCGGCCGGG TCATGGACAG TGGCGTATTT CGCAGCACTG CCTCCTCCGC TGCCACGCAC
CCCGACGGCA AAGTCGGACT GGGGCTTATC GAACATCTGT TTGCCACCGA CCGGATTGAC
GGCGCCGTGG TTTTGCCCAC GGACCATGTC TTCAACAAAG CCTCCCAACT GGCCAGGACC
CCGGACCAGC TCCGGGCCCT GGACGACAAA GGCCTGACCA CGTCCCAGGT CCGGGCGGCG
GCCTCGTCAC TCGAGGTTTT GGGGTCAGTG CGCAAAACCG GGCTGCGCCG GGTCGCCTTC
ATGGGCACCC CCTGCCAGGT GGAAACAGTG CGCAAAATGG AAATCCTGGC CGTGCCTCCT
GCCGAGCGGC TCTATTGCAC CATCGGGTGC TTTTGCGACG GCGACTTTCT GCTCGGTCCC
GCCCAGCAGG ACCGCTTGGA AAAACTCGGC TTCTTCCGCT GGAACGATGT GCGCCATATC
ACGTTGCGCG ACCATCTGGA ATTCTGGCTC GCCGAGGGCA AAACCCGGGA CATCCCCATG
GAAGACCTGG ATTTCATGCG CCGCTACGCC TGCCGCTTGT GTACTGACTA TTCCGCACAA
TTTGCGGACC TCGCCGTCGG CGGCACGGGG GCCCCGCATG GATGGAGTAC GGTCATCGCC
CGAACGCCCC TGGGCCAGGC CATTCTCAGT GAAGCCAGGC AGACCGCATT GGAACCATTC
AACGGGACCG CAGCTTCCCA GACCCGGCAC AACGCCTTGT ACACCGTCTG CCGGCTGGCG
GAGCGAAAAC AGCACCGGGC CCTGCACAGC AGCAGGGCGG CAATGTGA
 
Protein sequence
MASFFDLLRD IIHPGYCRRC GGCVAFCQAN NYGALETSPE GWPQFRNAER CLECGVCYQI 
CPATGQLTAE TRRRVAWSAP IGRVMDSGVF RSTASSAATH PDGKVGLGLI EHLFATDRID
GAVVLPTDHV FNKASQLART PDQLRALDDK GLTTSQVRAA ASSLEVLGSV RKTGLRRVAF
MGTPCQVETV RKMEILAVPP AERLYCTIGC FCDGDFLLGP AQQDRLEKLG FFRWNDVRHI
TLRDHLEFWL AEGKTRDIPM EDLDFMRRYA CRLCTDYSAQ FADLAVGGTG APHGWSTVIA
RTPLGQAILS EARQTALEPF NGTAASQTRH NALYTVCRLA ERKQHRALHS SRAAM