Gene Hmuk_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1747 
Symbol 
ID8411271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1662031 
End bp1663695 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content68% 
IMG OID645020075 
ProductASPIC/UnbV domain protein 
Protein accessionYP_003177568 
Protein GI257387795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.302243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.511021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC GACGGACGCT GGCGGTCACC GTCGTCGCAC TGCTGACCGT GACCGCCGGC 
TGCACTTCGA TCGCCGAGAT GGGGGGCGGC AGCGACACCG GCGGCGAGAT CGGCTTCGAG
GACGTGACCG GAGAGGTGGG GCTGAACTAC AGCGACGACG TGGCGGGCGG TGCCGGCAAC
GGCAACGCGG GCGTCTACGT CGCCGACGTG GACAACGACG GCTGGGACGA CGTGCTGGCG
GTCGGCGGCC AACGGCCGGT GCTGTTCGAG AACACCGGCG GCGAGTTCAG TCGCTCGGGC
GCGTTACCGG AGTACGACCG GCAGTTCAAG AGCGCCACGT TCGTCGACTA CGACGACGAC
GGCTGGAAGG ACCTCCTGTT GTTGCCCCGG GGCGGTGGCG TCGTCGCGCT CCACAACGAC
GAGGGATCGT TCGAACCGGA CGACGTGGGT CTGGAGAACG TCACACATCC CCTGGGAGCC
GCGCCAGCCG ACTTCGACGG CGACGGCCGC GTCGACCTGT TCCTCTACCA GTCCGGTGAC
TGGCTCGAAA AGACCCCGGC CGGCAAGAGC GCACTGAACG AGACGCTCAC GTCAGACAAC
GGCTACCCTA ACTACCTCTA TCGCAACGTC GGCGGCGAGT TCGAGCGCGT CGAGGACGCC
GGCATCGAGG GCGATCGCTG GAGTCTGGCC GCCACTGCAA CGGACCTGAC TGACGACGGC
CGACCGGACA TCTACGTGGC GAACGACTTC AACACCGACG TGCTCTACGT CAACGAGGGC
GGCGCCAGCT TCAGCCAGCG CCAGCTGGAG GGACCGACCG CGCGAAACGG GATGTCCGTC
GAGGTCGCCG ACCTGGGCGG TGACCGGCGT CGGGACGTGT TCGTCACGAA CATCGAACTG
CCCATCTCAC GGGACAACAT GTCCGAGGAA CGCTACGAGC TGCTCGAACG GATGTCGACG
TTCGCCTTCC ACTCGGGCCG GACGAAGGGC AACACCGTGA TGGTCAATCA GGGCAACGGC
ACGTTCGAGG ATCGAGCCAA GTCCCTGGGC GTCCAGACGG GTGGCTGGGG GTGGGCCGCG
ACGGCGACGG ACTTCGACAA CGACGGGGAC CGCGACCTCA TGCACGCGAC CCAGAACGTG
TTCCGGCTGG ACCCCGACGA TCCACGCTAC ACCTTCCCCA TGCTCTGGGA GCGGACCGAC
GGGGGCTTCG AGAACCTCGC AGCCGACGAG CGGGGGCTCC CCGAGACCGA CGGACGTGGT
CTGGTCGCAC TCGACTTCGA CAACGACGGC GATCAGGACG TGATATCGGC CGCCTACGGC
GACCGATTCG CCGTCTACGA GAACACGGTC ACCAGCGACA CGGATCGCCT GCAGTTCCGG
GCCGTCGACG AGACCGGCGG GACGGCACTG GGCGCGACCG TGACCGTGAG CGGCGAGGAC
AGCACCCAGC GGGTCGTCCA GTACTCGGAG ACGGACTACC TCTCACAGGA GAGTCAGGTC
GGCCACGTCG GCCTCGCGGG CGAGGGCAAC GTGACGATCA CGGTCACCTG GCCCGACGGG
ACCGAACGGA CCTTCGACGA CGTGGCACCG AACCAGCGGG TCCGCCTCTC GCCCGACGGC
ATCGAGACGG TGACGACGTT CCCGACCGGC GGTCAGACCG GGTGA
 
Protein sequence
MIDRRTLAVT VVALLTVTAG CTSIAEMGGG SDTGGEIGFE DVTGEVGLNY SDDVAGGAGN 
GNAGVYVADV DNDGWDDVLA VGGQRPVLFE NTGGEFSRSG ALPEYDRQFK SATFVDYDDD
GWKDLLLLPR GGGVVALHND EGSFEPDDVG LENVTHPLGA APADFDGDGR VDLFLYQSGD
WLEKTPAGKS ALNETLTSDN GYPNYLYRNV GGEFERVEDA GIEGDRWSLA ATATDLTDDG
RPDIYVANDF NTDVLYVNEG GASFSQRQLE GPTARNGMSV EVADLGGDRR RDVFVTNIEL
PISRDNMSEE RYELLERMST FAFHSGRTKG NTVMVNQGNG TFEDRAKSLG VQTGGWGWAA
TATDFDNDGD RDLMHATQNV FRLDPDDPRY TFPMLWERTD GGFENLAADE RGLPETDGRG
LVALDFDNDG DQDVISAAYG DRFAVYENTV TSDTDRLQFR AVDETGGTAL GATVTVSGED
STQRVVQYSE TDYLSQESQV GHVGLAGEGN VTITVTWPDG TERTFDDVAP NQRVRLSPDG
IETVTTFPTG GQTG