Gene Hmuk_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1946 
Symbol 
ID8411474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1856283 
End bp1857539 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content60% 
IMG OID645020277 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003177766 
Protein GI257387993 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.501136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACGA CGACTAAGAC GCTCGAAGCC ACGCTTGTCC CGCCGACAGC ACACAAAGAG 
CGGAAACTGT GTGACCTGCT CGAAACCTAC CGGGAGGGGC TTCACGAGGC GTTCGACGCC
AGGTGTGACA CGATGAGCGC AACCAGCGAT GTGGTGACGC CTTACGACCT GCCGTATCAG
GCGAAAGCGG CGCTGTGCAA CTACGTCCCG CAACTTCACA ACACCTACGA CGCTCAAGAG
TTAGACGATG ACCACCCGGT TCGGCTCACC AACCAAGCCG CCGAGTTTGA CCACTCTCCG
GCGCGTGACT ACGAGTTTAC GTGGTGGGCA CCACAACCCG GTCGTGGGAC GAATTTCTGG
ATACCGCTTC GTATCAATCC CGAACAGGAG GATCTGTGGC ACGACCTCGT AGATGGGAAC
GCTTCGGCAG GCCAACTCCG CCTGCAACGG AACCGCACAT CGTGGACGTT ACACGTCACT
GTCGAGTTTC CGGTCGAAGA ACCCGACTAC GCGACGGACG GCGACGACGT GACGCACATC
GGTCTGGATA TTGGTGAAAC TGCCCTGATA ACGGGCTGTG CCCTCAAGGA CGGTTCACCA
ACTGGCCCGT TCGTGTGTGA CGGGAGCCGT GCGAAGCATC TCCGCAAAGA GATGCACACC
ACCCTGAAAC GACTCCAAGA GCGAGACGCC GTCGAGTGGC GGATTGACGA GCGATTCAAC
CACTACCAGA ACGCGCTTAC CGATATTGTC GAGAAGGCGT CTCGGCAGGC CGTCGAGTAC
GCCCGGCAAT TCGAGAAGCC GGTGCTGGTA ATGGAGAACC TGACGTACAT CCGCGAAGAA
TTGGACTACG GTTCGTACAT GAACCGGCGA CTCCATGCGT GGGCGTTCGC TCGATTACAG
AACCGCGTCG AGGACAAATC GAAAGAGGCC GGTATCCCGG TCGAATACGT CCGACCGGAG
TACACCAGCC AGACGTGCCA CGCCTGCGGC CACATCGGAA ACAGAGCCGC GCAAGCCACG
TTCCGGTGTA CCAACGACGA GTGTCACATC ACGGAGTTTC AGGGCGATAT AAACGGCGCA
ATCAACGTTG CACAACGGGC TGACCCGTGG GGAGAGAGCG TGCCGCTGAA ACCGGCAGGC
AATGACTCGC CTCGGGATGG GAGTGCCTGT GACAGCACCA CGACCCACAC CAAGCAGAGC
CAACCACGGC AGATGACGCT TAGCGAGGTC GGGTCGGAAC CCACTGCCGG TAGTTGA
 
Protein sequence
METTTKTLEA TLVPPTAHKE RKLCDLLETY REGLHEAFDA RCDTMSATSD VVTPYDLPYQ 
AKAALCNYVP QLHNTYDAQE LDDDHPVRLT NQAAEFDHSP ARDYEFTWWA PQPGRGTNFW
IPLRINPEQE DLWHDLVDGN ASAGQLRLQR NRTSWTLHVT VEFPVEEPDY ATDGDDVTHI
GLDIGETALI TGCALKDGSP TGPFVCDGSR AKHLRKEMHT TLKRLQERDA VEWRIDERFN
HYQNALTDIV EKASRQAVEY ARQFEKPVLV MENLTYIREE LDYGSYMNRR LHAWAFARLQ
NRVEDKSKEA GIPVEYVRPE YTSQTCHACG HIGNRAAQAT FRCTNDECHI TEFQGDINGA
INVAQRADPW GESVPLKPAG NDSPRDGSAC DSTTTHTKQS QPRQMTLSEV GSEPTAGS