Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2469 |
Symbol | |
ID | 8420331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 2829061 |
End bp | 2830227 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 645039072 |
Product | protein of unknown function DUF39 |
Protein accession | YP_003199329 |
Protein GI | 258406587 |
COG category | [S] Function unknown |
COG ID | [COG1900] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000655672 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000229248 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGACGC AAGTGACCAA AACGTTGCAA GAAATCAACG CGCGAATCGA GAAGGGCAAG GCTGTAGTGC TCAATGCCCG GGAAATGAGC AAACTAGTCC GGAGTGAGGG GAAGGTCAAG GCAGCCAAAG AGGTTGACGT GGTCACTACC GGAACATTTT CGCCGATGTG TTCTTCTGGC ATGGTCTTTA ATATCGGGCA GCAACCGCCG ACGATGAAGG TTTCCCGGAT GTGGCTCAAC GGCGTGCCTG TCCATTGCGG CCTGGCGGCA GTAGACGCCT TTCTCGGCGC CACTGAACCA GCCGAGGACG ACCCACTCAA CAAAGTGTAT CCCGGGCAAT TCAAGTACGG CGGTGGCCAT GTCATCGAGG ATTTGATCCG TGGCAAGACT GTTCATATGA AGGCCGAAGC CTATGGAACG GACTGCTATC CCCGCACAGC GCTGGAAAAG GATGTCACCC TCTCGAGTTT CAAAAATTGC TGGATGTTCA ACCCGCGCAA CGCCTATCAA AACTACAATT GTGCCGTGAA TCTGACCAGC CGGACCAAGT ACACCTACAT GGGACCCTTG AAGCCGAATC TGGGCAATGC AAACTTTGCC ACAGCCGGAG AACTCAGTCC CCTGTTCAAC GATCCCTACT TCAGGACCAT TGGGCTGGGG ACCAGGATAT TTCTCGGGGG CTCGGTGGGC TATGTCCTTG GTTCAGGGAC CCAGCACAAT CCAAAGCCGT CGCGCAATGA GCGCGGTATT CCCCTGACCC CGTCCGGAAC CCTTATGGTC AAGGGCGAAA TGGCGGGCAT GGATCCCCGT TTTGCCCGTG GTGTGAGTAT TGTCGGTTAT GGGTGCTCCC TGTCGGTCGG CCTCGGAATT CCTATTCCCG TGCTCAATGA GGATATTGCC TGGTATACCG GGGTATCGAA CAGTGAGATC CACATGCCGA TCGTTGATTA CGGCCACGAT TATCCCCAGG GCGTGGGCCG GATTCTGCAG CACGCCAGCT TTGAGGAATT GTTAGGGGGG GAAATTCAAG TCAACGGCAA GAGTGTTCCC ACCGTCCCTG TCACCAGCGC CACGTATTCG CTGGAAGTGG CCGATACCTT GAAGTCGTGG ATTGAAAAGG GCGAGTTCCT GTTGACTGAG CCCCAGGAGA AGATCTGCTC TGAGTGA
|
Protein sequence | MATQVTKTLQ EINARIEKGK AVVLNAREMS KLVRSEGKVK AAKEVDVVTT GTFSPMCSSG MVFNIGQQPP TMKVSRMWLN GVPVHCGLAA VDAFLGATEP AEDDPLNKVY PGQFKYGGGH VIEDLIRGKT VHMKAEAYGT DCYPRTALEK DVTLSSFKNC WMFNPRNAYQ NYNCAVNLTS RTKYTYMGPL KPNLGNANFA TAGELSPLFN DPYFRTIGLG TRIFLGGSVG YVLGSGTQHN PKPSRNERGI PLTPSGTLMV KGEMAGMDPR FARGVSIVGY GCSLSVGLGI PIPVLNEDIA WYTGVSNSEI HMPIVDYGHD YPQGVGRILQ HASFEELLGG EIQVNGKSVP TVPVTSATYS LEVADTLKSW IEKGEFLLTE PQEKICSE
|
| |