Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1747 |
Symbol | |
ID | 8411271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1662031 |
End bp | 1663695 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645020075 |
Product | ASPIC/UnbV domain protein |
Protein accession | YP_003177568 |
Protein GI | 257387795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.302243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.511021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACC GACGGACGCT GGCGGTCACC GTCGTCGCAC TGCTGACCGT GACCGCCGGC TGCACTTCGA TCGCCGAGAT GGGGGGCGGC AGCGACACCG GCGGCGAGAT CGGCTTCGAG GACGTGACCG GAGAGGTGGG GCTGAACTAC AGCGACGACG TGGCGGGCGG TGCCGGCAAC GGCAACGCGG GCGTCTACGT CGCCGACGTG GACAACGACG GCTGGGACGA CGTGCTGGCG GTCGGCGGCC AACGGCCGGT GCTGTTCGAG AACACCGGCG GCGAGTTCAG TCGCTCGGGC GCGTTACCGG AGTACGACCG GCAGTTCAAG AGCGCCACGT TCGTCGACTA CGACGACGAC GGCTGGAAGG ACCTCCTGTT GTTGCCCCGG GGCGGTGGCG TCGTCGCGCT CCACAACGAC GAGGGATCGT TCGAACCGGA CGACGTGGGT CTGGAGAACG TCACACATCC CCTGGGAGCC GCGCCAGCCG ACTTCGACGG CGACGGCCGC GTCGACCTGT TCCTCTACCA GTCCGGTGAC TGGCTCGAAA AGACCCCGGC CGGCAAGAGC GCACTGAACG AGACGCTCAC GTCAGACAAC GGCTACCCTA ACTACCTCTA TCGCAACGTC GGCGGCGAGT TCGAGCGCGT CGAGGACGCC GGCATCGAGG GCGATCGCTG GAGTCTGGCC GCCACTGCAA CGGACCTGAC TGACGACGGC CGACCGGACA TCTACGTGGC GAACGACTTC AACACCGACG TGCTCTACGT CAACGAGGGC GGCGCCAGCT TCAGCCAGCG CCAGCTGGAG GGACCGACCG CGCGAAACGG GATGTCCGTC GAGGTCGCCG ACCTGGGCGG TGACCGGCGT CGGGACGTGT TCGTCACGAA CATCGAACTG CCCATCTCAC GGGACAACAT GTCCGAGGAA CGCTACGAGC TGCTCGAACG GATGTCGACG TTCGCCTTCC ACTCGGGCCG GACGAAGGGC AACACCGTGA TGGTCAATCA GGGCAACGGC ACGTTCGAGG ATCGAGCCAA GTCCCTGGGC GTCCAGACGG GTGGCTGGGG GTGGGCCGCG ACGGCGACGG ACTTCGACAA CGACGGGGAC CGCGACCTCA TGCACGCGAC CCAGAACGTG TTCCGGCTGG ACCCCGACGA TCCACGCTAC ACCTTCCCCA TGCTCTGGGA GCGGACCGAC GGGGGCTTCG AGAACCTCGC AGCCGACGAG CGGGGGCTCC CCGAGACCGA CGGACGTGGT CTGGTCGCAC TCGACTTCGA CAACGACGGC GATCAGGACG TGATATCGGC CGCCTACGGC GACCGATTCG CCGTCTACGA GAACACGGTC ACCAGCGACA CGGATCGCCT GCAGTTCCGG GCCGTCGACG AGACCGGCGG GACGGCACTG GGCGCGACCG TGACCGTGAG CGGCGAGGAC AGCACCCAGC GGGTCGTCCA GTACTCGGAG ACGGACTACC TCTCACAGGA GAGTCAGGTC GGCCACGTCG GCCTCGCGGG CGAGGGCAAC GTGACGATCA CGGTCACCTG GCCCGACGGG ACCGAACGGA CCTTCGACGA CGTGGCACCG AACCAGCGGG TCCGCCTCTC GCCCGACGGC ATCGAGACGG TGACGACGTT CCCGACCGGC GGTCAGACCG GGTGA
|
Protein sequence | MIDRRTLAVT VVALLTVTAG CTSIAEMGGG SDTGGEIGFE DVTGEVGLNY SDDVAGGAGN GNAGVYVADV DNDGWDDVLA VGGQRPVLFE NTGGEFSRSG ALPEYDRQFK SATFVDYDDD GWKDLLLLPR GGGVVALHND EGSFEPDDVG LENVTHPLGA APADFDGDGR VDLFLYQSGD WLEKTPAGKS ALNETLTSDN GYPNYLYRNV GGEFERVEDA GIEGDRWSLA ATATDLTDDG RPDIYVANDF NTDVLYVNEG GASFSQRQLE GPTARNGMSV EVADLGGDRR RDVFVTNIEL PISRDNMSEE RYELLERMST FAFHSGRTKG NTVMVNQGNG TFEDRAKSLG VQTGGWGWAA TATDFDNDGD RDLMHATQNV FRLDPDDPRY TFPMLWERTD GGFENLAADE RGLPETDGRG LVALDFDNDG DQDVISAAYG DRFAVYENTV TSDTDRLQFR AVDETGGTAL GATVTVSGED STQRVVQYSE TDYLSQESQV GHVGLAGEGN VTITVTWPDG TERTFDDVAP NQRVRLSPDG IETVTTFPTG GQTG
|
| |