Gene Dret_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1001 
Symbol 
ID8418823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1174962 
End bp1176134 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content54% 
IMG OID645037570 
ProductSufBD protein 
Protein accessionYP_003197867 
Protein GI258405125 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.36259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0114053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGAG ACGACAGAGA ACAAGTCGAT ATCAATACAT ATACGTTTGA GGGACCGGAT 
ACCGGGGGAA TCACCGATCT TCGGCATATG AGCGATGAGG ACAAGCAGCA GCTGTTGATG
TCTGGTGTCG ATGTTTCAGA GGAAGGGCGT AGCGGCACCT ATCTCCATGT CAACCAATCC
CAGGTCCACT GCTCCTCGTG CCAACCCGGC GTGGAGGTCC TGGATATCAA GCAGGCCCTT
GAAAAATACG ACGGCTTGCC CGACTACTAT TTTCAGGCCG TGGACAAGAA CAAGGACGAA
TACACGCAGG CCGTGGCTGA GAAATTGCAC GGCGGTTATT TTATCCGGAC CGAGAAAGGG
GCTAAAATCC TTGATCCGGT GCAGTCCTGT CTGTTTATCA AAGGTGAGAA TGTCGGGCAG
AATGTCCACA ACATTATCGT CGTTGAAGAG GACTCCGAAC TGCATATCAT CACCGGATGC
GCTGTTTCCC ACGGGGTGCA GAATGCATTG CACCTGGGCG TCTCCGAGTT CTATGTCAAG
AAAGGCGGGA AACTGACCTT TACCATGGTC CATAACTGGG GCGAACAGGT GAAGGTGCGG
CCGCGTACGG TGGGCATTGT CGAGGAAGAC GGGGTATTCA TGAATAATTA TGTACTCATG
AAGCCTGTCT ATTCCGTGCA GTCCTATCCG ACAGTTTACC TTAATGGGGA AAACGGTGTA
GCGACCTTCA ACTCGGTTCT TGTGGCGCCT CCGGGGTCGT ATATCAACAG CGGCAGCCGG
ATCGTTCTCA ATGCTCCGCA TACCCGCGGG GAGATCATTT CCCGGACCAT TACCACCGGC
GGGACGATTG TTGCTCCGGG GCACATCGTC GGCAATGCCG TTCCGGCCCG TGGCCATTTG
GAATGCAAGG GGCTGATTCT CGAAGACGGT GTCATTCACG CTATCCCGGA ACTCGAAGGG
TCGGTCACTG GCGTGGAACT CTCCCACGAA GCTGCGGTCG GCAAGATCGC CCAGGAGGAG
ATCGAATACC TCATGGCCCG CGGCCTTGAT GAAGACGAGG CCACATCGAC CATTGTCCGG
GGCTTTTTGA ACATGGACGT CTCCGGACTG CCGGATGAGC TGCAACAGAT GATCGAAAAG
ACGATCGAGG AAAGCGGTGA AGAAATGTTT TAA
 
Protein sequence
MRRDDREQVD INTYTFEGPD TGGITDLRHM SDEDKQQLLM SGVDVSEEGR SGTYLHVNQS 
QVHCSSCQPG VEVLDIKQAL EKYDGLPDYY FQAVDKNKDE YTQAVAEKLH GGYFIRTEKG
AKILDPVQSC LFIKGENVGQ NVHNIIVVEE DSELHIITGC AVSHGVQNAL HLGVSEFYVK
KGGKLTFTMV HNWGEQVKVR PRTVGIVEED GVFMNNYVLM KPVYSVQSYP TVYLNGENGV
ATFNSVLVAP PGSYINSGSR IVLNAPHTRG EIISRTITTG GTIVAPGHIV GNAVPARGHL
ECKGLILEDG VIHAIPELEG SVTGVELSHE AAVGKIAQEE IEYLMARGLD EDEATSTIVR
GFLNMDVSGL PDELQQMIEK TIEESGEEMF