Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0164 |
Symbol | |
ID | 8417968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 207223 |
End bp | 208188 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645036729 |
Product | hypothetical protein |
Protein accession | YP_003197044 |
Protein GI | 258404302 |
COG category | [C] Energy production and conversion |
COG ID | [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0015948 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.315388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACAA AGCTCTCGCG ACGGAATTTT TTAAAATCCC TTGGATTGGG AGCTACGGCG GCGGTAATGC CGTCGACCTT TGCCGCAGCA GCCCGGGGCG AAGAACTGGC GACCCTGCTG GATCTTTCCA AATGCGTCGG CTGCGAAAAC TGTGTCTACG CCTGCAAGGA GGTCAATCAG GACAAGTTTC CTGAGCCGGA AAAACCGTTT CCCACAATGT ATCCCAGCCG GGTTCCGGTC CAGGACTGGT CGGACAAGCG CGAGGTCAAA GACCGGCTGA CTCCCTACAA TTGGTTGTAT ATCCAGACCG CGTATGTCGA TTACGGCGGC CAGTCCTGGG AGATCCATGT CCCCCGTCGG TGTCTGCACT GCCAGAATCC GCCCTGCGCC AATCTCTGCC CCTGGGGGGC CGCCCGCAAA CAGGACAACG GGATCGTGCG GATTGACGAG GCCATTTGTC TCGGCGGATC CAAATGCAAC AAGGTCTGCC CCTGGCACAT CCCGCAGCGC CAGACCGGGG TGGGGCTGTA TCTGGATCTC TTGCCCAGCC TCGCCGGCAA CGGGGTGATG TACAAGTGCG ACCGGTGTTT CGACCGCATC GCCGAGGGGA AAGTCCCCGC CTGTATCGAA GCCTGCCCCT TTGATGTCCA GACCATTGGA CCGCGCAGTG AGATCGTGGC CGAGGCCCAT CGCCTGGCTG AAAAGATGCC GGGCTTTATC TACGGCGAGC ATGAAAACGG GGGCACGAAT ACGCTCTATG TCTCCCCCGT GCCCTTCGAT CGACTGAACG CAGCTGTGCA ACAGGGGGCC GGTCAGCCCG ATCTCGAGCC ACACCCCGAT ATGCTTTCTT CGGAAACCAA TCTGGCCAAG GCGATGCTCA TCGCCCCGGT GGCCGGTCTT GCCGCAGGGG CCCTGCACGC CGTACGCTTG GTCCAAAACG AGACCAAGGA GGACACCGAT GACTGA
|
Protein sequence | MPTKLSRRNF LKSLGLGATA AVMPSTFAAA ARGEELATLL DLSKCVGCEN CVYACKEVNQ DKFPEPEKPF PTMYPSRVPV QDWSDKREVK DRLTPYNWLY IQTAYVDYGG QSWEIHVPRR CLHCQNPPCA NLCPWGAARK QDNGIVRIDE AICLGGSKCN KVCPWHIPQR QTGVGLYLDL LPSLAGNGVM YKCDRCFDRI AEGKVPACIE ACPFDVQTIG PRSEIVAEAH RLAEKMPGFI YGEHENGGTN TLYVSPVPFD RLNAAVQQGA GQPDLEPHPD MLSSETNLAK AMLIAPVAGL AAGALHAVRL VQNETKEDTD D
|
| |