Gene Hmuk_0628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0628 
Symbol 
ID8410134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp587303 
End bp589513 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content71% 
IMG OID645018956 
Producttransglutaminase domain protein 
Protein accessionYP_003176467 
Protein GI257386694 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0435182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGG CAGGTCGGCT GGGGCTCGGC TGGCCCGCCG TCGACGGCGT CCCCGGACGG 
ACCCGGACTG CGGCGCTGGT CGCGGTCGCG GCGATGCTCC TCTCGACGCT GCAAGTGCTG
TACTTCTTCA TCGACGTGGT CGGGCTCCCG TCGCGGTTTC TGGCGGTCGT CGCCGCCTCG
CTGGTCGCCG CGACGGTGCT GGCCCGTCTC CTGCCCCCGC GCAGTGCCGT CGCGCTCGGC
GGCGTCCTCC TGTTGGTCGG GCTCTGGTGG TACGTCCAGC AACTGAGCGG CTCGCCGCGG
ATCGGGGAGT TGCTGACCGA CACGCTCTCC TTGCTGACTG GCAACACGCT CCTGCGGATC
ACCAACATTC GGGCGTGGGT GCTCGGTGTC ACGCCCGCGC CCACCTTCCT GACGTGGTAC
TTCGCGATGC GACGCCGCTA TGCCCTGTCC GTCGCCGCGG CGGGCGGGAC GCTGGGGTTC
CTGGTCCTGA CCGGCGACGC CGACCTGCTG ACGACGCTGT CGGGCGTCGT CGCCGCGGTG
GCGGCGCTTG GCTTTGGCGA CTTCGACCGC CGCGGCGAAC CGATCGTCGA CGCCGAGACG
ATCCTCGTGA CGGCCGCGCT GATGGTCGTC GTCCCGTCGC TGGTGTCGGT CGTCCCCGCC
ACTGCGGGGC TGTCGCTGAA CGTCGACGGC ACCGGCTCGG ACACGGTCGA GGCGAGTCTC
CTCCAGAGCG GCGACCAGCT CTCGGTCCAG GGCTCGATCA GCCTCTCGCC CGAGGTCCGC
TTTACCGTCA CCAGCAGCGA GCCTCGTTAC TGGCGGATCG GGAGCTTCGA CCGGTACACC
GGTGACGGGT GGGTCAGTCA GACCAACAGC CGCGCGTACG GCGGTGACCG ACTCGACGAG
CCGCCCGGCC CGACCCGGAT CGTCGAGCAG CGCTTCCGGG CCGAAACGGA CGCCGGGGTG
ATGCCGGCGG CCTGGAAGCC GATCCAGAGC CGTGGCAACC CCGCCACACG CGTCGGCGGC
GACGGGGGCC TCGCGACCGG CGCACCCATT CGAGAGGGCG ACAGCTATCG CGTCACCAGT
GCCGTCCCCG CGGCCACGCC CGCCCAGCTC AACGACACCA CGCGTGCCTA CCCCGCGCGA
GTCAACGAGA CGTACACGCA ACTGCCCGCG AGCACGCCCG ACCGCGTCGG CGAGCGGACC
GAGCGGCTCA CACGCGACGC GCGTACGCCC TACGAGACGG CGCTGACGAT CGAAAACTGG
CTGGAGAACA ACCGCGAGTA CTCGCTGGAC GTGCGCCGTC CCGACGGTAA CGTCGCCGAC
GCCTTCCTCT TCGAGATGTC GGCCGGCTAC TGCACCTACT ACGCGACGAC GATGGCGACG
ATGCTGCGGA CACAGGACAT CCCGGCGCGG ATGGCCGTCG GCTACACGTC CGGCGAACGG
GTCGCCGAGG ATCGATGGGT CGTGCGCGGA CAGAACGCCC ACGCCTGGGT CGAGGTCTAC
TTCGAAGAGT ACGGCTGGGT CCGGTTCGAC CCGACGCCGG CCAGCGACCG AGAGAGCGCC
CGCGACCGGA ACGTCGAGTC GGCCCGCGAA CGAGACCGCC CGTCCGTCGA CACCAACGAG
AGCGGCGGCC CGGAGTGGTC GCCGACGCCG ACGGCGACCC CACAGCCGCT CACGCCGGTC
GACAACGATA CCGACGCCGG GGGAACGCCG CTCGGTCCCC AGAGTCGTCC CGGCGTCAAC
CCGGAAGACA GCATCTCGAC CGCGACGCGG GTCGGCGAGT CCCCCACCGA CACTGTCGGT
GCGACGACGG ACCGGCCCGG CCCGACCAGT TCGCTGCCGT CCCGTCGGGA GGCGGCACTG
GGACTGCTCG CGATCGTCGG GACCGTGATC GGCGTGCGCC GGAGCGATCT GGGCCGGCGC
GTCTACCGGG GCGTGTGGCT CTACTACCAG CCCCGGTCGA CCCCCGAGCG AGACGCCGAG
CGGGCCTTTC AGCGGCTCGT GTACCATCTG GGGCGCGAAC ACGACCGGCC GCGCCGTGCC
GAGGAGACCG TCCGAGCGTA TCTCGACGCC GTCGACGCCG ACGAGCGCGC ACGCGAGGTG
GCATCGATCA GAGAACGGGC TCGCTACGCC GGCACGGTCG ACGAGGCGGC GGCCGATCGG
GCCGTCTCGC TCGTCGACGA GATCGTTCGC TCGCGTGGCA CGGCGAAATA A
 
Protein sequence
MSTAGRLGLG WPAVDGVPGR TRTAALVAVA AMLLSTLQVL YFFIDVVGLP SRFLAVVAAS 
LVAATVLARL LPPRSAVALG GVLLLVGLWW YVQQLSGSPR IGELLTDTLS LLTGNTLLRI
TNIRAWVLGV TPAPTFLTWY FAMRRRYALS VAAAGGTLGF LVLTGDADLL TTLSGVVAAV
AALGFGDFDR RGEPIVDAET ILVTAALMVV VPSLVSVVPA TAGLSLNVDG TGSDTVEASL
LQSGDQLSVQ GSISLSPEVR FTVTSSEPRY WRIGSFDRYT GDGWVSQTNS RAYGGDRLDE
PPGPTRIVEQ RFRAETDAGV MPAAWKPIQS RGNPATRVGG DGGLATGAPI REGDSYRVTS
AVPAATPAQL NDTTRAYPAR VNETYTQLPA STPDRVGERT ERLTRDARTP YETALTIENW
LENNREYSLD VRRPDGNVAD AFLFEMSAGY CTYYATTMAT MLRTQDIPAR MAVGYTSGER
VAEDRWVVRG QNAHAWVEVY FEEYGWVRFD PTPASDRESA RDRNVESARE RDRPSVDTNE
SGGPEWSPTP TATPQPLTPV DNDTDAGGTP LGPQSRPGVN PEDSISTATR VGESPTDTVG
ATTDRPGPTS SLPSRREAAL GLLAIVGTVI GVRRSDLGRR VYRGVWLYYQ PRSTPERDAE
RAFQRLVYHL GREHDRPRRA EETVRAYLDA VDADERAREV ASIRERARYA GTVDEAAADR
AVSLVDEIVR SRGTAK