Gene Hmuk_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1671 
Symbol 
ID8411194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1599700 
End bp1600917 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content57% 
IMG OID645019998 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003177492 
Protein GI257387719 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATTC AGCGTACTGC TGTCGTCAAA CTTTCCGTCC CCGACCAGCG GCGCGACGAC 
CTGAAACGAA CGATGAACAC GTTTCGGGAC GCTGCACAGC GGTTTGCCAA CCGGGGATGG
GAAAGAGATG ACAATGGGTA CGTGATAACG TCGCGCTCTC GACTACAACC GCTCGTCTAC
GACGACATAC GGGACGACAC CGGCCTTCAC TCAGACTTGA CCGTGGCCGC CGTCAACCAT
GCCGCCGACG CGCTTACCGG CTGTGTAGAC AAAATGAAAG CTGGCGAACG CCCCTCAAAA
CCTGTGTTCA CGTCGAACAC GACCGTCTAC AACACCAGTG CAATCAGTTA CTTCGACGGA
TACTGTTCGC TGGCCGCTTA CGGAAGTGGG CGTGTTCATG CTGAATACGT CTACCCAGAC
GACTCGCTCC AGGCCGAATA CATGGAGAGT AGCGAGTGGA CCAAACAAGG CGCGAAACTC
CGATACGACC ATCAGACCGA TACCTACTAC TTGCACGTTT CCGTCAAACA GGAACGCGAA
GATTCGTTGG AAGAGGCCGA GAGCCGAACA GTTCTCGGCG TAGACCGGAA CGTCGACGGG
TATCTTGCTG TCACCAGTAC AGGAGCGTTC ATCGGCAACG CTGACCTACT GAACCACAAG
CGCCGCGAGT ATGAACGTCG TCGCGCCCGA CTACAACAAC AGGGGACGCG AAGCGCACAC
CTCACGATTC AGTCAATCGG TGACACCTTC GCTAACTGGT CCGAGGACGT TCTACACCAA
ACGTCGAAAC GACTGGTGAA AGAAGCCATG TCACGGGGCT GTTCGGCAAT CGTGTTCGAG
GACTTGGAAC AGATACGAGA ACGTATCTCG AACGCCTCGA AATTCCAGCA GTGGGCGTTC
CGCGAGTTGA AGCGCCAGAC GACATACAAA GCCCGTGCCG AAGGAATCGC TGTCGAATCA
GTCCATCCGG CCTACACCAG CCAGCGGTGT AGTCACGCCG ACTGTGGCTT CACCCACGAG
GACAACCGCG ACGGCGACCA GTTCACCTGC CAGAAATGCG GGAAAGAACT TCATAGCGAC
TACAACGCGG CGCGCAACAT CGCACACAGA TTCATCCAGA ACCGGCTCAA GTCTGGTTCT
GGAGGGGCGA CCCATCACCT CGCCCTGAAG TCGGGAACAG TGAACGGGAA CGGCGACTAC
TCGCCTTCCA CAGTATAG
 
Protein sequence
MEIQRTAVVK LSVPDQRRDD LKRTMNTFRD AAQRFANRGW ERDDNGYVIT SRSRLQPLVY 
DDIRDDTGLH SDLTVAAVNH AADALTGCVD KMKAGERPSK PVFTSNTTVY NTSAISYFDG
YCSLAAYGSG RVHAEYVYPD DSLQAEYMES SEWTKQGAKL RYDHQTDTYY LHVSVKQERE
DSLEEAESRT VLGVDRNVDG YLAVTSTGAF IGNADLLNHK RREYERRRAR LQQQGTRSAH
LTIQSIGDTF ANWSEDVLHQ TSKRLVKEAM SRGCSAIVFE DLEQIRERIS NASKFQQWAF
RELKRQTTYK ARAEGIAVES VHPAYTSQRC SHADCGFTHE DNRDGDQFTC QKCGKELHSD
YNAARNIAHR FIQNRLKSGS GGATHHLALK SGTVNGNGDY SPSTV