Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0760 |
Symbol | |
ID | 8410274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 728225 |
End bp | 729538 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645019095 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003176598 |
Protein GI | 257386825 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.332278 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA TCGCACTCTC GCTGGGGGGT ATCGCGCTCG CGCTCTTTCT GGTCCTGTTG AACGGCTTCT TCGTCGCCTC GGAGTTCGCC TTCGTCCGGA TCAGGGCGAC CTCCGTCCAG CAACTCGTCG AAGAGGGGGC CGCCGGTGCT GGCGTGCTGG ACGACGTGAT GGACAACCTC GACGACTACC TCGCGACGAC GCAACTGGGG ATCACGGTCG CGTCGCTGGG ACTGGGGTGG GTCGGTGAGC CCGCCATCGC CGACCTGCTG GAGCCGGTGC TGGCTCCGAT TCTGCCACCG AGTCTGCTCC ACGCCGTCGC CTTCGCCGTC GGGTTCACCG TCATCACCTT CCTCCACGTG GTCTTCGGAG AACTCGCGCC CAAGACGCTG GCTATCGCGG ACGCCGAGAA GATCTCGCTG CTCGTGGCCG CGCCGATGAA GTTCTTCTTC TATCTCCTCT ATCCGGGCAT CGTCGTGTTC AACGGCTCTG CGAACTTCTT CACGCAGTTG ATCGGCGTCG AGCCCGCTTC CGAGAGCGAG GAGACGCTCG AAGAAGAGGA GATTTTGCGG GTGCTGAACC AGTCCGGGCA GGCGGGCCAC GTCGACGCGG GCGAAGTCGA GATGATCCAG CGCGTCTTCG AGTTCGACGA CCGGTCCGTC CGCGAGGTGA TGGTCCCGCG ACCTGACGTG ATCAGTGTCA CGGCCTCCAC GCCGGTGACG GAACTGCGTT CGATCGTCCT CGACGCCGGG CACACGCGTT ATCCCGTGGT CGAGGGCGAC GACGGCGACC AGGTGGTCGG CTTCGTCGAC GCCAAGGACG TGCTTCGGGT GCTGGATGCC GGCGACGAGT CCCCGGCGAC CGCCGGGGAC ATCGCGCGGG ACCTCCCGAT GGTTCCGGAG TCGACCCGCA TCGACGACCT CCTGCGGGAG TTTCAGGACG AGCAGCGCCA GATGGCGATC GTGATCGACG AGTGGGGCGC CTTCGAGGGG ATCGCCACCG TCGAGGACGT CCTCGAGACG CTCGTGGGCG ACCTCCAGGA CGGCTTCGAC GCGGCGACGG GGGAACCCTC GATCGACGCG CGCGACGACG GTTCGTACCG CGTCGACGGT GCAGTCCCCC TCTCGACGGT CAACGACGAA CTGGACGCCA CCTTCGAGAG TCCCGCCTTC GAGACGATCG GTGGCCTCGT GCTGGATCGG CTGGGCCGGG CCCCGAAGGC CGGGGACACG GTCGAGACCG ACGGCTACCT GATCACCGTC GTGAGCGTCG ACGGCGCGCG CGTCTCGGTC GTCGACGTGG AACCGGCGAC GTGA
|
Protein sequence | MTDIALSLGG IALALFLVLL NGFFVASEFA FVRIRATSVQ QLVEEGAAGA GVLDDVMDNL DDYLATTQLG ITVASLGLGW VGEPAIADLL EPVLAPILPP SLLHAVAFAV GFTVITFLHV VFGELAPKTL AIADAEKISL LVAAPMKFFF YLLYPGIVVF NGSANFFTQL IGVEPASESE ETLEEEEILR VLNQSGQAGH VDAGEVEMIQ RVFEFDDRSV REVMVPRPDV ISVTASTPVT ELRSIVLDAG HTRYPVVEGD DGDQVVGFVD AKDVLRVLDA GDESPATAGD IARDLPMVPE STRIDDLLRE FQDEQRQMAI VIDEWGAFEG IATVEDVLET LVGDLQDGFD AATGEPSIDA RDDGSYRVDG AVPLSTVNDE LDATFESPAF ETIGGLVLDR LGRAPKAGDT VETDGYLITV VSVDGARVSV VDVEPAT
|
| |