Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0628 |
Symbol | |
ID | 8410134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 587303 |
End bp | 589513 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645018956 |
Product | transglutaminase domain protein |
Protein accession | YP_003176467 |
Protein GI | 257386694 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0435182 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACGG CAGGTCGGCT GGGGCTCGGC TGGCCCGCCG TCGACGGCGT CCCCGGACGG ACCCGGACTG CGGCGCTGGT CGCGGTCGCG GCGATGCTCC TCTCGACGCT GCAAGTGCTG TACTTCTTCA TCGACGTGGT CGGGCTCCCG TCGCGGTTTC TGGCGGTCGT CGCCGCCTCG CTGGTCGCCG CGACGGTGCT GGCCCGTCTC CTGCCCCCGC GCAGTGCCGT CGCGCTCGGC GGCGTCCTCC TGTTGGTCGG GCTCTGGTGG TACGTCCAGC AACTGAGCGG CTCGCCGCGG ATCGGGGAGT TGCTGACCGA CACGCTCTCC TTGCTGACTG GCAACACGCT CCTGCGGATC ACCAACATTC GGGCGTGGGT GCTCGGTGTC ACGCCCGCGC CCACCTTCCT GACGTGGTAC TTCGCGATGC GACGCCGCTA TGCCCTGTCC GTCGCCGCGG CGGGCGGGAC GCTGGGGTTC CTGGTCCTGA CCGGCGACGC CGACCTGCTG ACGACGCTGT CGGGCGTCGT CGCCGCGGTG GCGGCGCTTG GCTTTGGCGA CTTCGACCGC CGCGGCGAAC CGATCGTCGA CGCCGAGACG ATCCTCGTGA CGGCCGCGCT GATGGTCGTC GTCCCGTCGC TGGTGTCGGT CGTCCCCGCC ACTGCGGGGC TGTCGCTGAA CGTCGACGGC ACCGGCTCGG ACACGGTCGA GGCGAGTCTC CTCCAGAGCG GCGACCAGCT CTCGGTCCAG GGCTCGATCA GCCTCTCGCC CGAGGTCCGC TTTACCGTCA CCAGCAGCGA GCCTCGTTAC TGGCGGATCG GGAGCTTCGA CCGGTACACC GGTGACGGGT GGGTCAGTCA GACCAACAGC CGCGCGTACG GCGGTGACCG ACTCGACGAG CCGCCCGGCC CGACCCGGAT CGTCGAGCAG CGCTTCCGGG CCGAAACGGA CGCCGGGGTG ATGCCGGCGG CCTGGAAGCC GATCCAGAGC CGTGGCAACC CCGCCACACG CGTCGGCGGC GACGGGGGCC TCGCGACCGG CGCACCCATT CGAGAGGGCG ACAGCTATCG CGTCACCAGT GCCGTCCCCG CGGCCACGCC CGCCCAGCTC AACGACACCA CGCGTGCCTA CCCCGCGCGA GTCAACGAGA CGTACACGCA ACTGCCCGCG AGCACGCCCG ACCGCGTCGG CGAGCGGACC GAGCGGCTCA CACGCGACGC GCGTACGCCC TACGAGACGG CGCTGACGAT CGAAAACTGG CTGGAGAACA ACCGCGAGTA CTCGCTGGAC GTGCGCCGTC CCGACGGTAA CGTCGCCGAC GCCTTCCTCT TCGAGATGTC GGCCGGCTAC TGCACCTACT ACGCGACGAC GATGGCGACG ATGCTGCGGA CACAGGACAT CCCGGCGCGG ATGGCCGTCG GCTACACGTC CGGCGAACGG GTCGCCGAGG ATCGATGGGT CGTGCGCGGA CAGAACGCCC ACGCCTGGGT CGAGGTCTAC TTCGAAGAGT ACGGCTGGGT CCGGTTCGAC CCGACGCCGG CCAGCGACCG AGAGAGCGCC CGCGACCGGA ACGTCGAGTC GGCCCGCGAA CGAGACCGCC CGTCCGTCGA CACCAACGAG AGCGGCGGCC CGGAGTGGTC GCCGACGCCG ACGGCGACCC CACAGCCGCT CACGCCGGTC GACAACGATA CCGACGCCGG GGGAACGCCG CTCGGTCCCC AGAGTCGTCC CGGCGTCAAC CCGGAAGACA GCATCTCGAC CGCGACGCGG GTCGGCGAGT CCCCCACCGA CACTGTCGGT GCGACGACGG ACCGGCCCGG CCCGACCAGT TCGCTGCCGT CCCGTCGGGA GGCGGCACTG GGACTGCTCG CGATCGTCGG GACCGTGATC GGCGTGCGCC GGAGCGATCT GGGCCGGCGC GTCTACCGGG GCGTGTGGCT CTACTACCAG CCCCGGTCGA CCCCCGAGCG AGACGCCGAG CGGGCCTTTC AGCGGCTCGT GTACCATCTG GGGCGCGAAC ACGACCGGCC GCGCCGTGCC GAGGAGACCG TCCGAGCGTA TCTCGACGCC GTCGACGCCG ACGAGCGCGC ACGCGAGGTG GCATCGATCA GAGAACGGGC TCGCTACGCC GGCACGGTCG ACGAGGCGGC GGCCGATCGG GCCGTCTCGC TCGTCGACGA GATCGTTCGC TCGCGTGGCA CGGCGAAATA A
|
Protein sequence | MSTAGRLGLG WPAVDGVPGR TRTAALVAVA AMLLSTLQVL YFFIDVVGLP SRFLAVVAAS LVAATVLARL LPPRSAVALG GVLLLVGLWW YVQQLSGSPR IGELLTDTLS LLTGNTLLRI TNIRAWVLGV TPAPTFLTWY FAMRRRYALS VAAAGGTLGF LVLTGDADLL TTLSGVVAAV AALGFGDFDR RGEPIVDAET ILVTAALMVV VPSLVSVVPA TAGLSLNVDG TGSDTVEASL LQSGDQLSVQ GSISLSPEVR FTVTSSEPRY WRIGSFDRYT GDGWVSQTNS RAYGGDRLDE PPGPTRIVEQ RFRAETDAGV MPAAWKPIQS RGNPATRVGG DGGLATGAPI REGDSYRVTS AVPAATPAQL NDTTRAYPAR VNETYTQLPA STPDRVGERT ERLTRDARTP YETALTIENW LENNREYSLD VRRPDGNVAD AFLFEMSAGY CTYYATTMAT MLRTQDIPAR MAVGYTSGER VAEDRWVVRG QNAHAWVEVY FEEYGWVRFD PTPASDRESA RDRNVESARE RDRPSVDTNE SGGPEWSPTP TATPQPLTPV DNDTDAGGTP LGPQSRPGVN PEDSISTATR VGESPTDTVG ATTDRPGPTS SLPSRREAAL GLLAIVGTVI GVRRSDLGRR VYRGVWLYYQ PRSTPERDAE RAFQRLVYHL GREHDRPRRA EETVRAYLDA VDADERAREV ASIRERARYA GTVDEAAADR AVSLVDEIVR SRGTAK
|
| |