Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0541 |
Symbol | |
ID | 8418350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 650400 |
End bp | 651374 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645037106 |
Product | protein of unknown function DUF34 |
Protein accession | YP_003197416 |
Protein GI | 258404674 |
COG category | [S] Function unknown |
COG ID | [COG0327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGTCC AAGACCTCCT GGCGCACATT GAGACGGTGG CTCGGCCGGA CCTGGCGGCC TCCTGGGACC AAAGCGGGAT CCAGATTGCA GGCACGGCCC AACACATCAC CAAGATGGCC GTGACCCTCG ACGCTGTTCC GGCGACAGTC ACCGCTGCCC TCGACTGGGG GGCCGACTTC ATTCTGACCC ATCACCCGCT GAGCCTCAAA CCAACCCTCC CCAGTGCCTG CGACGCCTAT CACAGGATTC TGCGCTCGAC CCTTGAGGCC GGTGTCTGGA TGTACGCGGC CCACACAAGC CTTGACGCCC AGCCCAATGG GCCCGTGGCT TGGCTGGCGG ACGCGTTGGG CCTGCAGCAA CGAAGCGTTG TTTCACCGAG TTGTACGGAA CGGGCCCGCG TCTATCAGAT TCACGGCTTG TCCAATGGAT CCGGGCTCGA GGAACTTCCG GGCGTTTTTG ATATCCGCCA GACCGGTACC AGTGAGTGGG AGGTTACGGC CTGGCCGCAG GCTGAAGCCG CCATCAGCGC TCTGCCCGCC AGACGGGTAC ATGTTCAGGA AGCGCTGCAT CCCTGCAAGG TCTACGGTTT TGGCTGTGCA GGACGCCTTG CCACCCCGGT TTCTGCGGAA AGATTTACCG AACAGGTTGC GGCCCTTCTC AATATCAGGG GGTGGAACGA AATTGGTCTC CGCCCGGAAA CTCTCAGTAC AGTCGCCTAC TGCCCCGGTT CCGGGGCGGA TTTCGCCTCA GCCGCCTTTG GCGCCGGGGC CGACGTCTTC CTCACCGGTG ACGTCAAATA CCACCAAGCC CAACAGGTGG AATCCCTGGG GTGGACTCTG GATGTCGGCC ATTTCAGTCT CGAGGAGCAT ATGATGTCTT CCTGGGCCGA ACGCCTGAGC GAAGAACTCC TGGACGAGCA TATCGCCGTG GCCTTTTTTC CAGGGAACAA CCCCCTTACA TTGTCTACAG GATAA
|
Protein sequence | MRVQDLLAHI ETVARPDLAA SWDQSGIQIA GTAQHITKMA VTLDAVPATV TAALDWGADF ILTHHPLSLK PTLPSACDAY HRILRSTLEA GVWMYAAHTS LDAQPNGPVA WLADALGLQQ RSVVSPSCTE RARVYQIHGL SNGSGLEELP GVFDIRQTGT SEWEVTAWPQ AEAAISALPA RRVHVQEALH PCKVYGFGCA GRLATPVSAE RFTEQVAALL NIRGWNEIGL RPETLSTVAY CPGSGADFAS AAFGAGADVF LTGDVKYHQA QQVESLGWTL DVGHFSLEEH MMSSWAERLS EELLDEHIAV AFFPGNNPLT LSTG
|
| |