Gene Dret_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1964 
Symbol 
ID8419809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2248215 
End bp2250515 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content58% 
IMG OID645038552 
Productmethyl-viologen-reducing hydrogenase delta subunit 
Protein accessionYP_003198826 
Protein GI258406084 
COG category[C] Energy production and conversion 
COG ID[COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0030199 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGAGA AGATCGGTAT TTACTTTGAC GAGGCGAGCG TCAATGGCAT GGTCAACTTT 
GATGCCCTCG CGCAAAAGTT AGCGAACAAG TGGGGCGATA CCTGTCCCGT GGTCAAATCA
CACCCGAACC TGGCCTCTGA CGAAGGCCGG AAGCTGATTC AGGCGGATAT CGATGCGGGA
ACCATCGATG GGGTCTGTAT TTGCGGGACT TCGCCGCGGA TGGATTGGGA TCTCTACGAT
TTCGGTTCAA ACGTGGTCGT TGATCGGGTC AATCTCCGCG AAATCTGTGT CTATGCCTAC
CGCAACCCGG ACGGATCCGA ACTCGATCCC AGCGCTGAGC CGCCGGAATT GCTGCAGGAG
ATGGCCCGCG ATTATATCAA TATGGGCGTG GTCAAACTGC GCAACATGAA CATCCCCAAT
GCGGAGGAAC TCGACTCCGT CAAGCGGGTC CTTGTCCTCG GCGGCGGCTG GAGCGGATTG
ACCGCCGCTG TCAATGCGGC TCAGGCCGGC TATGAAGTCG TGGTGGTCGA AAAGGACAAG
CAGCTTGGCG GCTATGCCGG TAAAATGCAC AAGACTATCC CCTTTGCCGC GCCGTATGCG
CAGGCCCATG AGACCGGGGT CGAACAGAAG ATCAGCGCTG TTCAGGGCAA CGACAAGATT
ACGGTCTATA CCGGCAGCAA GCTCGAAAAA ATTGCCGGAC AGCCCGGGGA TTTCGAGGCC
ACCTTGAACA CCGGATCCGG TCAGGAAACG GTCAAGGTCG GCGCAGTCGT CGTGGCCACC
GGTTGGGCCG CGCAGGATGA CAAGTTCCTG GCTCCGTTCG GATATGGCTC TCTGCCCAAC
GTGGTGACCT CCATGCAGGT CGAGGAGATG GCCAAGGAAG GGACGATCAA ACGTCCCAGC
GATGGCAAGA CGCCGCAAAG CGTCCTCTTT TTGACCGGCT TTGGGGACAA GCTCGAGCAG
TTCGCCGAAG AGGAAAAGGC CCAGGCCGAA GCGGAAGCCA AGAAGGCCGA GCAGCCCAGC
GACGATGATG AACCGGCTAT TGCCGAAGAG TTCAAGAAGA CCGAATCCTA TCGGCACCTG
CCCTATACCT CGGAGATCAC CTCTCTGACC GCGTTGAAGC AGGCCCGGTA CGTTCGCGAA
ATGGCTCCGG CCTCGATGGC CTACATGGTC TATGAGCACA TGATGGTCCC TGGCGTGAAT
GAGCTCTACT ACAAGTCCGC TCAGGACGAT CCGGGCATCA TGCTGACCAA AGGTGATATC
GGGGCCATCA AGGAAGGACC GGATGGCAAA GTCCAGGTCG AGGTCTTCAA CACCCTTATT
GGCGAAGACA TCCTCCTCGA AGTGGACATG CTCGTGGTCC CGACCGCCAT GGTTCCGACG
ACGGCCCTCG AACCGGTGAT CCAATTGCAA TACCGCCAGG GCGAGGCGTT CCCGGATCTC
GAACTCTTCG ACGGCTTTGC CGATTCGAAC TATATCTGCT TTCCCTACGA GACCCGGCGG
ACAGGCATCT ATGCCGCCGG CTCCGTGCGT CAGCCCATGA CTCTGGCCAC ATCAGAGACC
GATGCCCTGG GCGCGGCGCT GAAATCGATC CAATGCATCG AATCGGTCAA CCGGGGTATG
GCTGTCCATC CGCGCTCCGG GGACATGACC TATCCGATCT TCAACTTCAC CCGGTGCACG
CAGTGTAAGC GGTGCACCGA AGAGTGCCCC TTTGGCGCCT TGGACGACGA CGAAAAGGGA
ACGCCGTTGC CGAACCTGGC CCGGTGCCGC CGTTGCGGGA CCTGCATGGG CGCTTGCCCG
GAACGTGTCA TTTCCTTTGA CAACTACGGT GTCGGACAGA TCGGCTCCAT GATCCGTGAG
ATCAACGTTC CGGATGAGAT GGAGACCGAA GGTCCCCGCA TCCTGGTTCT GGCCTGTGAA
AACGATGCCT ATCCCGCTCT GGACATGGCT GCCATGCGCG GCAAGCATTG GTCGCCGTAT
GTCCGTTTCA TCCCGGTCCG CTGCCTTGGT TCGGTGAACA CCATCTGGGT GGCTGATGCC
ATGGGCAAAG GGATCGACGG TGTCCTCCTG TTGGGCTGCA AATACGGTGA CGACTACCAA
TGCCACTTCG TCAAAGGAAG TGAGCTCTGC AGTCGCCGGA TGCAGAACAT CGGGGAAACG
CTGGATCGGC TCGGTGTGGA GAAAGAGCGC GTGGTGCAGA AGGAAGTCGC TATCGACGAC
TACAATATCG TGCCTGAGCT CATAGATTCA TTTGTGGACT ATGTGACCGC ATTGGGACCG
AATCCGTACA AGGGCTTCTA G
 
Protein sequence
MAEKIGIYFD EASVNGMVNF DALAQKLANK WGDTCPVVKS HPNLASDEGR KLIQADIDAG 
TIDGVCICGT SPRMDWDLYD FGSNVVVDRV NLREICVYAY RNPDGSELDP SAEPPELLQE
MARDYINMGV VKLRNMNIPN AEELDSVKRV LVLGGGWSGL TAAVNAAQAG YEVVVVEKDK
QLGGYAGKMH KTIPFAAPYA QAHETGVEQK ISAVQGNDKI TVYTGSKLEK IAGQPGDFEA
TLNTGSGQET VKVGAVVVAT GWAAQDDKFL APFGYGSLPN VVTSMQVEEM AKEGTIKRPS
DGKTPQSVLF LTGFGDKLEQ FAEEEKAQAE AEAKKAEQPS DDDEPAIAEE FKKTESYRHL
PYTSEITSLT ALKQARYVRE MAPASMAYMV YEHMMVPGVN ELYYKSAQDD PGIMLTKGDI
GAIKEGPDGK VQVEVFNTLI GEDILLEVDM LVVPTAMVPT TALEPVIQLQ YRQGEAFPDL
ELFDGFADSN YICFPYETRR TGIYAAGSVR QPMTLATSET DALGAALKSI QCIESVNRGM
AVHPRSGDMT YPIFNFTRCT QCKRCTEECP FGALDDDEKG TPLPNLARCR RCGTCMGACP
ERVISFDNYG VGQIGSMIRE INVPDEMETE GPRILVLACE NDAYPALDMA AMRGKHWSPY
VRFIPVRCLG SVNTIWVADA MGKGIDGVLL LGCKYGDDYQ CHFVKGSELC SRRMQNIGET
LDRLGVEKER VVQKEVAIDD YNIVPELIDS FVDYVTALGP NPYKGF