Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0629 |
Symbol | |
ID | 8418441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 755211 |
End bp | 757148 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 645037194 |
Product | protein of unknown function DUF115 |
Protein accession | YP_003197501 |
Protein GI | 258404759 |
COG category | [S] Function unknown |
COG ID | [COG2604] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.822878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000000292404 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGATG TTAAGCAAAA TGTTTTTTTT AATGCAAGGT ATAAGCGTAA TATTTCAGTA TTAAAGACAA AATTTTTTGA TTTGTATTCT AAAATATTAA ATTGCGATAG CGATAAATTG AGTATATATT CATCAAATAA TTATGTCCAA GTATTTAAAA AAAACAAACT AATATTTGAA GGAAGTATTG AAAAAAAACT TTCAAATCTT TTTCAAGATA ATTTAAAATT GACAGCCTTG AAGCCCGCTC AGCAAAGTTT GAAAGGAGAG TTTTTAAAAA GGAATCCCTT GTTTGCAAAA TATTATAATC AATTGCATTG TCTTGCTGAC GAATATGCAA ACACTGAAAA TGTTTTTCAA GTGGCAAAAA AAGATATTGG ATTAATATAT GTGTACGGTG TTATTGGCGG TCAGCAAATT ATAGACCTAT TAGAAAATTT CAATATACGC CACTTAGTGT TTATAGAAAG AAGTATAGAT ATTATTGAAG CGTCACTTTA TTTTATTGAT TGGGTTGGAG TGTTGGACTT AATTGAGCAA AAGAATACTC ATTTAAATGT TGTAATTGAT AATGATACGC AAAGGCTTTT TAGAGAAACA TTAGATACTG CATTAAAATC TAATCCTGTT TTTATTTACT CATCTATGCA TTTATTTCAA TATAAGTGTA ATGATTTTGA GTCATGTTAT AACAAACTAT CTTTAGAAAG TGGAAAACTT TTCGAAATGC GAGTTGTTGA TACGGAGCTA CGAATATGTA AAAATACGGC TTTTAATATG TCAAAACCTC GTCAAGTCTT GCAAAAAAGA TATTTTTGTA AAAAAAGCAC AAAAGATAGT AGTATTTTTG CGGCGGCAAT CGTAGGTTCT GGTCCTTCTT TAGATGAAAG TATAAAATAT TTGAAAGATC ATAAAAATAA TTTTACAATT TTTTCTTGCG GATCTTCTCT TCTTTCACTC TATGAAAATA ATGTTTTGCC TGACTATCAT GTCGATATTG AGCGTCAGCA CAATATTCAG CCCTATATTA ATTATATTGA TAAAGAATAT TTAAACAATG TTCATGTATT GTATCCAATC TATTTCAAAG AAAGATATGA GTATGCCGAT TTGTTTAAAT CGAAAACATA CTTTCATGTA CAAAATAACT ATGATTTGTT TTTTGCTAAT GTGGCCGAGG GCGATTTGCT TTTAAGCGGA GGATTGACAG TAACGAGTTA TGCTTTAGAA CTCGCCTTGC AAGCAACTTT TAATGTTGTT TACCTGTTTG GGTGTGACTT GGGGTATTTT AATACTCAAA ATCATCATGC TAAGGGGCAT GTTCATTACA AATATACATA TAAATATGAT GCGATTAATA TGCAGGTAGT ACCCGGTAAT TTTAAAAACC ATGTTTGCTC TGACGGAGTT CTTTACCATG AAATAGAAAC TTATGAAACT ATTTTAAAAA ATTACAATAA ATCATGTGTT TTGAATACAA GTAATGGCGC AAAGGTGTGT AATACTTTCC CTATTCCATC TAAATATATA TATACAAATT ATTATATTAA AAATAACTTA TATAGTATTA AACGCACAAA TGAAATTAAT GATCAAGTTT TTTTTGATTT ATGTTATTTA TATTATGCGC TGATAAAAGA TTTTAAAAGT ATGCTGTTTC AGAAAATTTA TAGTTTCAGT ACATTCATCG ATTTATATAT GGATTTGCAT TCTTATTTGC AAAATAATTT GCAGGATAAT TCACCTGCTT TGTTTAATAT TATCAATTAT ACGCTATTGC AGTATATGCA TATTGTTTTT TCTAATGCGT ATAGCATTGT TGACGAAGAT TTAACGATAG AATTTGTGGA AAAGGCTAAC CGTATACTGT ATGACTTTTT GTGTGATCTT GAGCCCGAGC TCGCTGAGTT AGCAGAGGCA GTTAATGAAG CGCGCTAG
|
Protein sequence | MIDVKQNVFF NARYKRNISV LKTKFFDLYS KILNCDSDKL SIYSSNNYVQ VFKKNKLIFE GSIEKKLSNL FQDNLKLTAL KPAQQSLKGE FLKRNPLFAK YYNQLHCLAD EYANTENVFQ VAKKDIGLIY VYGVIGGQQI IDLLENFNIR HLVFIERSID IIEASLYFID WVGVLDLIEQ KNTHLNVVID NDTQRLFRET LDTALKSNPV FIYSSMHLFQ YKCNDFESCY NKLSLESGKL FEMRVVDTEL RICKNTAFNM SKPRQVLQKR YFCKKSTKDS SIFAAAIVGS GPSLDESIKY LKDHKNNFTI FSCGSSLLSL YENNVLPDYH VDIERQHNIQ PYINYIDKEY LNNVHVLYPI YFKERYEYAD LFKSKTYFHV QNNYDLFFAN VAEGDLLLSG GLTVTSYALE LALQATFNVV YLFGCDLGYF NTQNHHAKGH VHYKYTYKYD AINMQVVPGN FKNHVCSDGV LYHEIETYET ILKNYNKSCV LNTSNGAKVC NTFPIPSKYI YTNYYIKNNL YSIKRTNEIN DQVFFDLCYL YYALIKDFKS MLFQKIYSFS TFIDLYMDLH SYLQNNLQDN SPALFNIINY TLLQYMHIVF SNAYSIVDED LTIEFVEKAN RILYDFLCDL EPELAELAEA VNEAR
|
| |