Gene Hmuk_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2044 
Symbol 
ID8411575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1945692 
End bp1946948 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content58% 
IMG OID645020378 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003177864 
Protein GI257388091 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAGA CGACCCGCAC CTACGTCGTA CGCATCACGA ACCACGGTCA GGTTCGTGAC 
GACCTTGACC AGTGCGGGTT CGTAGCATCC AAGCTGTGGA ACGTCGGACG CCACTACATC
CAAGGTCGGT GGGATGAGGA CGGTGAAATA CCCGACGAGA AAGAGCTAAA ATCGGAGTTG
AAAGACCACG AACGCTACAG TGACCTGCAT TCTCAGTCAA GTCAGCGAGT TCTCGAAGAG
CTTGCTGAGG CGTTCACCGG CTGGTACAAC TCCGACGACG GCAATAACCC ACCCGGCTAT
CGGAAACGTG GTGACCGACA CCCGCGCTCC ACCGTGACGT GGAAGCAGAA AGCCATCAAG
CACGACGACA AGCACGGCCA GCTTCGTCTC TCGAAAGGCT TCAACCTGAA AGAGAGTCGA
TCTGACTTCA TCCTCGCGGA ATACGAAACC CGCCCTGACG TAGAAGTCGA GCACATCCAG
CAGGTGCGTG CCGTCTGGAA CGGCGACGAG TGGGAACTAC ACCTCGTCTG CAAGAAAGAA
ATTCCAATCG AGGACGCACC CGGCGACGCC ACGGCGGGTA TCGATCTCGG TATCAGCAAC
TACCTCGCTA TCGACTATGA GAACGGCCCC TCGGAGCTGT ATCCGGGGAA CGTGCTGAAA
GAGGACAAGC ACTACTTCAC CCGCGAGGAG TATCAGACCG AAGGCGAGAA CGGGCCGTCG
AAGCGTGCGC GGAAGGCTCG CCGGAAACTC TCCCGACGCA AAGACCACTT CCTCCACACT
CTCAGCAAGC ACATCGTTGA GCAGTGTGTC GAAGAAGGTG TGGGAAAGAT CGCGGTTGGC
GACCTCAGCG ACATTCGCGA AGGCGAGAAC GGTGATTCGC GGAATTGGGG TCCGTCGGGA
AACAAGAAGT TGCACGGATG GGAGTTCGGC CGCTTCGCCC GTCTGCTCGA ATACAAGGCC
GAGGAACACG GCATCCTCGT TGATCGTGTA GACGAGGAGA ACACCTCAAA GACGTGTTCG
TGTTGCGGAC AGATTCGTGC TAGCAACCGC GTGGAGCGTG GGCTGTACGT CTGTGAGTCG
TGCGAGACGA CGATGAATGC GGACGTGAAC GGTGCGGTGA ACATTCGGAG AAAGATAACT
CAGAGTCCCC CGACGGGGGA TATGAGTAAC GGCTGGTTGG CACAGCCCGG AGTCTTCCTG
TTCGACCGCG AGAGCGGACG GTTCCCACCG AGAGAACAGG GAGACTGTAG ACCCTAA
 
Protein sequence
MLETTRTYVV RITNHGQVRD DLDQCGFVAS KLWNVGRHYI QGRWDEDGEI PDEKELKSEL 
KDHERYSDLH SQSSQRVLEE LAEAFTGWYN SDDGNNPPGY RKRGDRHPRS TVTWKQKAIK
HDDKHGQLRL SKGFNLKESR SDFILAEYET RPDVEVEHIQ QVRAVWNGDE WELHLVCKKE
IPIEDAPGDA TAGIDLGISN YLAIDYENGP SELYPGNVLK EDKHYFTREE YQTEGENGPS
KRARKARRKL SRRKDHFLHT LSKHIVEQCV EEGVGKIAVG DLSDIREGEN GDSRNWGPSG
NKKLHGWEFG RFARLLEYKA EEHGILVDRV DEENTSKTCS CCGQIRASNR VERGLYVCES
CETTMNADVN GAVNIRRKIT QSPPTGDMSN GWLAQPGVFL FDRESGRFPP REQGDCRP