Gene Dret_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1966 
Symbol 
ID8419811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2252103 
End bp2254106 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content58% 
IMG OID645038554 
Productadenylylsulfate reductase subunit alpha 
Protein accessionYP_003198828 
Protein GI258406086 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID[TIGR02061] adenosine phosphosulphate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00115626 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCTCAGA TTCCTGTGAA AGATGAGCCC AAAGGTCTGC CGCTGGCAGA ACCTGAAGTG 
GTAGAGATGG ATGTCGATGC GTTGATTGTT GGTGGCGGAA TGGGCTCCTG CGGAACGGCT
TTCGAGGCTG TTCGCTGGGC CGACAAGTAC GCCCCTGAAC TGAAGATCCT GCTGCTCGAC
AAAGCCGCTC TGGAGCGCTC CGGTGCTGTG GCCCAGGGGC TGTCGGCTAT CAATACCTAT
CTCGGCGACA ATGAACCCGA CGACTACGTC CGCATGGTCC GGACCGACCT CATGGGCATT
GTCCGCGAAG ACCTGATCTA CGACCTTGGC CGCCACGTTG ACGATTCCGT TCACCTCTTC
GAGGAATGGG GTCTGCCGTG CTGGGTCAAG AAAGACGGCA AGAACCTCGA CGGCGCCAAA
GCCAAGTCCG AGGGCTTGTC CCTGCGCACC GGCGCGACCC CGGTTCGCTC CGGCCGCTGG
CAGATGATGA TCAACGGTGA GTCCTACAAG AACATCGTTG CCGAAGCCGC CAAGAACGCC
TTGGGCGAAG ACCGCTACAT GGAGCGCATC TTCATCGTGA AGCTGCTCCT GGACGCCAAT
GAGCCTAATC GGATCGCTGG CGCCGTCGGT TTCTCCGCCC GTGAAAACAA AGTCTTCATC
TTCAAGGCCA ATGCCATCAA CGTGGCCTGC GGCGGCGCCG TGAACGTGTA CCGCCCCCGC
TCCACTGGTG AAGGCATGGG TCGCGCCTGG TATCCGGTCT GGAACGCTGG TTCCACCTAC
ACCATGGTGG CCCAGGTCGG CGGCGAAATG ACCATGATGG AAAACCGCTT CGTCCCCGCC
CGCTTCAAAG ACGGCTATGG TCCGGTCGGT GCCTGGTTCT TGCTGTTCAA GGCCAAAGCC
ACCAACGCCA AGGGCGAAGA CTACTGCCAG ACCAATGCGG CCATGCTCAA GCCGTATCAG
GATCGCGGTT ACGCCGTCGG TGCGGTTATC CCCACCTGCT TGCGGAACCA CATGATGCTG
CGTGAAATGC GCGAAGGTCG TGGCCCGATC TACATGGACA CCGCCACTGC GCTGCAGACC
ACCTTCAAGG AGCTCAACAA GCAAGAGCAG AAGCACCTTG AAAGTGAAGC CTGGGAAGAC
TTCCTCGACA TGTGCGTCGG TCAGGCCAAC CTCTGGGCGG CCATGAACAT CAAGCCTGAA
GAATCGGGCT CTGAGATCAT GCCCACTGAG CCGTACCTGC TCGGTTCCCA CTCCGGTTGC
TGCGGTATCT GGGTTTCCGG ACCCGACGAA GACTGGGTCC CTGAAGAATA CAAGATCAAG
GCTGACAACG GGAAGGTCTA CAACCGCATG ACCACCGTGA ACGGCCTGTG GACCTGCGCT
GACGGCGTTG GCGCCTCCGG ACACAAGTTC TCCTCCGGTT CCCACGCCGA AGGCCGGATC
GTTGGAAAGC AAATGGTCCG CTGGTGCGTT GATCACAAGG ATTTCAAACC GGCTCTGAAG
CAGTCTGCTG AAGAGCTGAA GAAAGAAATC TATCAGCCGT ACTACACCTA CCAGGAAAAC
AAAGACGTCT CCACGGATCC GGTGGTCAAC CCCAACTACA TCTCGCCGCG TAACTTCATG
TTCCGCCTGA CCAAGTGCAC GGACGAGTAT GGTGGTGGTT GCTCCACGTA CTACACGACT
TCGGAAGCTT TGCTGAAGAC CGGCTTTGAA CTCCTGGACA TGATGGAAGA AGACTCCAAG
CTCTTGGCTG CTCGTGACCT GCACGAACTG CTGCGTTGCT GGGAAAACTT CCACCGTCTC
TGGACTGTCC GTCTGCACAT GCAGCACATC GAATTCCGTA AGGAAAGCCG TTACCCCGGT
TTCTACTATC GTGCTGAGTA CATGGGCATT GATGACAGCA AATGGCGTTG CTTCGTCAAC
TCCAAGTACG ATCCGGAAAA GGGCGAGACG ACTCTGTTCA AGCGCCCCTA CATCCAGATC
ATTCCCGATC CGATGTCCCC GTAG
 
Protein sequence
MPQIPVKDEP KGLPLAEPEV VEMDVDALIV GGGMGSCGTA FEAVRWADKY APELKILLLD 
KAALERSGAV AQGLSAINTY LGDNEPDDYV RMVRTDLMGI VREDLIYDLG RHVDDSVHLF
EEWGLPCWVK KDGKNLDGAK AKSEGLSLRT GATPVRSGRW QMMINGESYK NIVAEAAKNA
LGEDRYMERI FIVKLLLDAN EPNRIAGAVG FSARENKVFI FKANAINVAC GGAVNVYRPR
STGEGMGRAW YPVWNAGSTY TMVAQVGGEM TMMENRFVPA RFKDGYGPVG AWFLLFKAKA
TNAKGEDYCQ TNAAMLKPYQ DRGYAVGAVI PTCLRNHMML REMREGRGPI YMDTATALQT
TFKELNKQEQ KHLESEAWED FLDMCVGQAN LWAAMNIKPE ESGSEIMPTE PYLLGSHSGC
CGIWVSGPDE DWVPEEYKIK ADNGKVYNRM TTVNGLWTCA DGVGASGHKF SSGSHAEGRI
VGKQMVRWCV DHKDFKPALK QSAEELKKEI YQPYYTYQEN KDVSTDPVVN PNYISPRNFM
FRLTKCTDEY GGGCSTYYTT SEALLKTGFE LLDMMEEDSK LLAARDLHEL LRCWENFHRL
WTVRLHMQHI EFRKESRYPG FYYRAEYMGI DDSKWRCFVN SKYDPEKGET TLFKRPYIQI
IPDPMSP