Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2949 |
Symbol | |
ID | 8385258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 3037424 |
End bp | 3039136 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644974027 |
Product | protein of unknown function DUF181 |
Protein accession | YP_003131843 |
Protein GI | 257054010 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.108847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTTG GTCTCGTGGG TGCCGGTCCA GCCGCAGACG CGGTCGAATC GGCGCTCTCG GACGTCGAGG GCGCTGTGGA GACGGCGGCC CCCGACGAAA TCGACGAGTA CGAACTGGCT ATCGTGATCG AGGTCGCCGG TGCAAACGCC TTCGAGCAGG CAAACCAGCG GGCGATCCAG ACAGGCACGT CCTGGATCGC GATCGAACTC GGCGGCGTTG GCGGCGTCCC GGTCGTCGAT GCCGCGATCA CTGGCTTCGG CCCCGAAACG GCCTGCTACG AGTGTCTTCG CAAGCGGGTC AGGGCGAACG TCGATCCGAC CGAGGAGCCG ACGAAAGCGC CACCGCCAAC CACGGCACGG TTCGCCGGTG CGGTCGCCGG GCGGGCGGCC GCCAGTACAC TGGATCGGTC GGTCGACGGT CCCGAGGTGG TCGGATTCGT GAGGGAGATT CCGGACGCGA CCCGGCGATT GTTGTCCGTC CCTCACTGTG AGTGCGGTTC GGAACCGTCC CGTACTATCG ACCGGACGAC GGCCGACCGA GATCTGGAAG CCACCATCGG CCGGGCCGAA CAGGGACTCG ACGACCGGGT CGGGATCGTC CAGGAGGTCG GCGAGATCGA ATCCTTCCCC GCGCCCTACT ATCTCGCACG AAACGGCGAC ACTGAGGGGT TCAGCGACGT GTCATCGACC GGTCCGGCCG CGGGTGTCGA CGTCGACTGG AACGGCGCGC TGATGAAGGC ACTGGGGGAG GCCTACGAAC GGTACAGTGC GGGCGTCTAT CGAACGGATC GGCTACAGAC AGCGTCGGTC GAGGAGCTGG ACAATCCGAT CGCGCCGTCC GCGTTCGTGA CCCCCGAGCC ACCCGAATCG GACTCCATCG AGTGGATCGA GGGCGAGAAT CTGGCGACGA GCCAGTCAGT ACAGGTGCCG GCGTCCGTGG CGATGTATCC TCCCGCCGGG GATCGGTTCC GACCCGCGAA CACGACCGGG CTCGGGTTCG GTAACGCGAC AGTCGAGGCG TTGCTGTCGG GACTCTACGA GGTGATCGAG CGGGACGCGG CGATGCTCTC GTGGTATTCG ACGTTCGAGC CCCTGGCACT CGACGTCGGC GACGGGGCCT TCGAGACGCT TGCCCGGCGG GTGCGGACTG AGGGACTGTC GGTCACGCCG CTCCTTCTCA CACAGGACGT CGACGTTCCG GTCGTGGCTG TCGCTGTCCA CCGGGCGGAC GGCGACTGGC CCCGGTTCGC CCTGGGTTCG GGAGCACATC TTGATCCGGC TGCCGCCGCC CGATCGGCAC TAGCGGAGGC CGTCCAGAAC TGGTTCGAAC TCCGGCAGAT GGGGCCGGAC GGCGCAGCCG ATGCCGGCGG GGCGATCGGC GAGTACGCGG CCTTCCCCGA TATCGTCGCC GACTTCGTCG ACGCCGCAGG GACCGTCTCG GCCGACGCTG TCGCGCCCGA CGAGCCACTT GCGGGTTCCG AGGCCCTGTC GACTGTGGTC GATCGAGTCA CCGCGGCCGG GCTCTCGCCG TACGCCGTCC GGCTCACGCC GCGGGACGTC GAGGCGATGG GATTCGAGGC CGTTCGCGTG CTCGTCCCAT CGGCCCAGCC GCTGTTCCTC GGGGAGCCGT ACTTTGGCGA CCGACTGGAG ACGGTCTCGG ACCGCCTCGG CTTCGAGGCG CGACCTGACC GACCGTTCCA TCCGTTCCCC TGA
|
Protein sequence | MTVGLVGAGP AADAVESALS DVEGAVETAA PDEIDEYELA IVIEVAGANA FEQANQRAIQ TGTSWIAIEL GGVGGVPVVD AAITGFGPET ACYECLRKRV RANVDPTEEP TKAPPPTTAR FAGAVAGRAA ASTLDRSVDG PEVVGFVREI PDATRRLLSV PHCECGSEPS RTIDRTTADR DLEATIGRAE QGLDDRVGIV QEVGEIESFP APYYLARNGD TEGFSDVSST GPAAGVDVDW NGALMKALGE AYERYSAGVY RTDRLQTASV EELDNPIAPS AFVTPEPPES DSIEWIEGEN LATSQSVQVP ASVAMYPPAG DRFRPANTTG LGFGNATVEA LLSGLYEVIE RDAAMLSWYS TFEPLALDVG DGAFETLARR VRTEGLSVTP LLLTQDVDVP VVAVAVHRAD GDWPRFALGS GAHLDPAAAA RSALAEAVQN WFELRQMGPD GAADAGGAIG EYAAFPDIVA DFVDAAGTVS ADAVAPDEPL AGSEALSTVV DRVTAAGLSP YAVRLTPRDV EAMGFEAVRV LVPSAQPLFL GEPYFGDRLE TVSDRLGFEA RPDRPFHPFP
|
| |