Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2044 |
Symbol | |
ID | 8411575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1945692 |
End bp | 1946948 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 645020378 |
Product | transposase, IS605 OrfB family |
Protein accession | YP_003177864 |
Protein GI | 257388091 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGAGA CGACCCGCAC CTACGTCGTA CGCATCACGA ACCACGGTCA GGTTCGTGAC GACCTTGACC AGTGCGGGTT CGTAGCATCC AAGCTGTGGA ACGTCGGACG CCACTACATC CAAGGTCGGT GGGATGAGGA CGGTGAAATA CCCGACGAGA AAGAGCTAAA ATCGGAGTTG AAAGACCACG AACGCTACAG TGACCTGCAT TCTCAGTCAA GTCAGCGAGT TCTCGAAGAG CTTGCTGAGG CGTTCACCGG CTGGTACAAC TCCGACGACG GCAATAACCC ACCCGGCTAT CGGAAACGTG GTGACCGACA CCCGCGCTCC ACCGTGACGT GGAAGCAGAA AGCCATCAAG CACGACGACA AGCACGGCCA GCTTCGTCTC TCGAAAGGCT TCAACCTGAA AGAGAGTCGA TCTGACTTCA TCCTCGCGGA ATACGAAACC CGCCCTGACG TAGAAGTCGA GCACATCCAG CAGGTGCGTG CCGTCTGGAA CGGCGACGAG TGGGAACTAC ACCTCGTCTG CAAGAAAGAA ATTCCAATCG AGGACGCACC CGGCGACGCC ACGGCGGGTA TCGATCTCGG TATCAGCAAC TACCTCGCTA TCGACTATGA GAACGGCCCC TCGGAGCTGT ATCCGGGGAA CGTGCTGAAA GAGGACAAGC ACTACTTCAC CCGCGAGGAG TATCAGACCG AAGGCGAGAA CGGGCCGTCG AAGCGTGCGC GGAAGGCTCG CCGGAAACTC TCCCGACGCA AAGACCACTT CCTCCACACT CTCAGCAAGC ACATCGTTGA GCAGTGTGTC GAAGAAGGTG TGGGAAAGAT CGCGGTTGGC GACCTCAGCG ACATTCGCGA AGGCGAGAAC GGTGATTCGC GGAATTGGGG TCCGTCGGGA AACAAGAAGT TGCACGGATG GGAGTTCGGC CGCTTCGCCC GTCTGCTCGA ATACAAGGCC GAGGAACACG GCATCCTCGT TGATCGTGTA GACGAGGAGA ACACCTCAAA GACGTGTTCG TGTTGCGGAC AGATTCGTGC TAGCAACCGC GTGGAGCGTG GGCTGTACGT CTGTGAGTCG TGCGAGACGA CGATGAATGC GGACGTGAAC GGTGCGGTGA ACATTCGGAG AAAGATAACT CAGAGTCCCC CGACGGGGGA TATGAGTAAC GGCTGGTTGG CACAGCCCGG AGTCTTCCTG TTCGACCGCG AGAGCGGACG GTTCCCACCG AGAGAACAGG GAGACTGTAG ACCCTAA
|
Protein sequence | MLETTRTYVV RITNHGQVRD DLDQCGFVAS KLWNVGRHYI QGRWDEDGEI PDEKELKSEL KDHERYSDLH SQSSQRVLEE LAEAFTGWYN SDDGNNPPGY RKRGDRHPRS TVTWKQKAIK HDDKHGQLRL SKGFNLKESR SDFILAEYET RPDVEVEHIQ QVRAVWNGDE WELHLVCKKE IPIEDAPGDA TAGIDLGISN YLAIDYENGP SELYPGNVLK EDKHYFTREE YQTEGENGPS KRARKARRKL SRRKDHFLHT LSKHIVEQCV EEGVGKIAVG DLSDIREGEN GDSRNWGPSG NKKLHGWEFG RFARLLEYKA EEHGILVDRV DEENTSKTCS CCGQIRASNR VERGLYVCES CETTMNADVN GAVNIRRKIT QSPPTGDMSN GWLAQPGVFL FDRESGRFPP REQGDCRP
|
| |