Gene Hmuk_3293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3293 
Symbol 
ID8409371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp93613 
End bp94830 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content56% 
IMG OID645018227 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003175748 
Protein GI257372974 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.944265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.342145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATTC AGCGTACTGC TGTCGTCAAA CTTTCCGTCC CCGACCAGCG GCGCGACGAC 
CTGAAACGAA CGATGAACAC GTTTCGGGAC GCTGCACAGC GGTTTGCCGA CCGGGGATGG
GAAAGAGATG ACAATGGGTA CGTGATAACG TCGCGCTCTC GACTACAACC GCTCGTCTAC
GACGACATAC GGGACGACAC CGGCCTTCAC TCAGACTTGA CCGTGGCCGC CGTCAACCAT
GCTGCCGACG CGCTTACCGG CTGTGTAGAC AAAATGAAAG CTGGCGAACG AACCTCAAAA
CCTGTGTTCA CGTCGAACAC GACCGTCTAC AACACCAGTG CAATCAGTTA CTTCGACGGA
TACTGTTCGC TGGCCGCTTA CGGAAGTGGG CGTGTTCATG CTGCATACGT CTACCCAGAC
GACTCGCTCC AGGCCGAATA CATGGAGAGT AGCGAGTGGA CCAAACAAGG CGCGAAACTC
CGATACGACC ATCAGACCGA TACCTACTAC TTGCACGTTT CCGTCAAACA GGAACGCGAA
GATTCGTTGG AAGAGGCCGA GAGCCGAACA GTTCTCGGCG TAGACCGGAA CGTCGACGGG
TATCTTGCTG TCACCAGTAC AGGAGCATTC ATCGGCAACG CTGACCTACT GAACCACAAG
CGCCGCGAGT ATGAACGTCG TCGCGCCCGA CTACAACAAC AGGGGACGCG AAGCGCACAC
CTCACGATTC AGTCAATCGG TGACACCTTC GCTAACTGGT CCGAGGACTT TCTACACCAA
ACGTCGAAAC GACTGGTGAA AGAAGCCATG TCACGGGGCT GTTCGGTAAT CGTGTTCGAG
GACTTGGAAC AGATACGAGA ACGTATCTCG AACGCCTCGA AATTCCAGCA GTGGGCGTTC
CGCGAGTTGA AGCGCCAGAC GACATACAAA GCCCGTGCCG AAGGAATCGC TGTCGAATCA
GTCCATCCGG CCTACACCAG CCAGCGGTGT AGTCACGCCG ACTGTGGCTT CACTCACGAG
GACAACCGCG ACAGCGACCA GTTCACCTGC CAGAAGTGCG GCAAAGAGCT ACACGCCGAC
TACAACGCGG CACGCAACGT TGCACACAGA TTCATCCAGA ATCGGCTCAA GTCTGGTTCT
GGAGGGGCGA CCCATCACCT CGCCCTGAAG TCGGGGACAA TGAACGGGAA CGGCGACTAC
TCGCCTTCCA CAGTATAG
 
Protein sequence
MEIQRTAVVK LSVPDQRRDD LKRTMNTFRD AAQRFADRGW ERDDNGYVIT SRSRLQPLVY 
DDIRDDTGLH SDLTVAAVNH AADALTGCVD KMKAGERTSK PVFTSNTTVY NTSAISYFDG
YCSLAAYGSG RVHAAYVYPD DSLQAEYMES SEWTKQGAKL RYDHQTDTYY LHVSVKQERE
DSLEEAESRT VLGVDRNVDG YLAVTSTGAF IGNADLLNHK RREYERRRAR LQQQGTRSAH
LTIQSIGDTF ANWSEDFLHQ TSKRLVKEAM SRGCSVIVFE DLEQIRERIS NASKFQQWAF
RELKRQTTYK ARAEGIAVES VHPAYTSQRC SHADCGFTHE DNRDSDQFTC QKCGKELHAD
YNAARNVAHR FIQNRLKSGS GGATHHLALK SGTMNGNGDY SPSTV