Gene Dret_0164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0164 
Symbol 
ID8417968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp207223 
End bp208188 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content60% 
IMG OID645036729 
Producthypothetical protein 
Protein accessionYP_003197044 
Protein GI258404302 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0015948 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.315388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACAA AGCTCTCGCG ACGGAATTTT TTAAAATCCC TTGGATTGGG AGCTACGGCG 
GCGGTAATGC CGTCGACCTT TGCCGCAGCA GCCCGGGGCG AAGAACTGGC GACCCTGCTG
GATCTTTCCA AATGCGTCGG CTGCGAAAAC TGTGTCTACG CCTGCAAGGA GGTCAATCAG
GACAAGTTTC CTGAGCCGGA AAAACCGTTT CCCACAATGT ATCCCAGCCG GGTTCCGGTC
CAGGACTGGT CGGACAAGCG CGAGGTCAAA GACCGGCTGA CTCCCTACAA TTGGTTGTAT
ATCCAGACCG CGTATGTCGA TTACGGCGGC CAGTCCTGGG AGATCCATGT CCCCCGTCGG
TGTCTGCACT GCCAGAATCC GCCCTGCGCC AATCTCTGCC CCTGGGGGGC CGCCCGCAAA
CAGGACAACG GGATCGTGCG GATTGACGAG GCCATTTGTC TCGGCGGATC CAAATGCAAC
AAGGTCTGCC CCTGGCACAT CCCGCAGCGC CAGACCGGGG TGGGGCTGTA TCTGGATCTC
TTGCCCAGCC TCGCCGGCAA CGGGGTGATG TACAAGTGCG ACCGGTGTTT CGACCGCATC
GCCGAGGGGA AAGTCCCCGC CTGTATCGAA GCCTGCCCCT TTGATGTCCA GACCATTGGA
CCGCGCAGTG AGATCGTGGC CGAGGCCCAT CGCCTGGCTG AAAAGATGCC GGGCTTTATC
TACGGCGAGC ATGAAAACGG GGGCACGAAT ACGCTCTATG TCTCCCCCGT GCCCTTCGAT
CGACTGAACG CAGCTGTGCA ACAGGGGGCC GGTCAGCCCG ATCTCGAGCC ACACCCCGAT
ATGCTTTCTT CGGAAACCAA TCTGGCCAAG GCGATGCTCA TCGCCCCGGT GGCCGGTCTT
GCCGCAGGGG CCCTGCACGC CGTACGCTTG GTCCAAAACG AGACCAAGGA GGACACCGAT
GACTGA
 
Protein sequence
MPTKLSRRNF LKSLGLGATA AVMPSTFAAA ARGEELATLL DLSKCVGCEN CVYACKEVNQ 
DKFPEPEKPF PTMYPSRVPV QDWSDKREVK DRLTPYNWLY IQTAYVDYGG QSWEIHVPRR
CLHCQNPPCA NLCPWGAARK QDNGIVRIDE AICLGGSKCN KVCPWHIPQR QTGVGLYLDL
LPSLAGNGVM YKCDRCFDRI AEGKVPACIE ACPFDVQTIG PRSEIVAEAH RLAEKMPGFI
YGEHENGGTN TLYVSPVPFD RLNAAVQQGA GQPDLEPHPD MLSSETNLAK AMLIAPVAGL
AAGALHAVRL VQNETKEDTD D