Gene Hmuk_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0783 
Symbol 
ID8410297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp753324 
End bp756548 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content63% 
IMG OID645019118 
ProductLanthionine synthetase C family protein 
Protein accessionYP_003176621 
Protein GI257386848 
COG category[V] Defense mechanisms 
COG ID[COG4403] Lantibiotic modifying enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.265394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.258057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTG TCTTCACTGA CGCTGAAAAA CGATCGATCG TCGGACAGGC CCGCACACTG 
CACGAACGAG TACAGACGCT CGACGAGTAC GACCAAGACG TCGAACACGA CGAACGAATC
GAGGCACTGT GGGACGAATG GCGGTCGCAG TTTCCCAGCG ACGGGAGCTT CGAGGATCGA
ATCGAGTGGT CGGACGTTTC CGAGTCCGAG TGGCGGCGAG CCATCGAGGT CGAAACGCTG
AACGAAGGCG CGTCAGTCCC GGCGTGGGTC GACCGCCTGG AAGCGGCGGT CGCAGCGATA
CAGGACCGCT CGCCCGAACG TGCAGACACG CGCTTCGACG TCGACACGGA CGAACGACGG
TTGCTCGGAG AACTGACCGC GGCACTGGCT GATTACGTGT GCGATCGCGT CCCAGCTGAG
GTTGAGGAGG CGCTCACCGA CCAAGCTATC GCCGAGATGG GTGAGTGGTT CAGAACTCGG
TTCCAGAACC GTTTTTCCCG AATTCTGTTC GTGGAGTTCA AGTCGTTCGT GGCAGCACAC
GACCGTGAAC TCGCATTCGC GGACCCGAAC GATTTCGACG AACCGCCGAG CGAATACTAC
GACCGATTCG TCGCGTACCT GTTCGACGGT GGGTTCGTCG ACCTCTGTCA GGAGTATCCG
GTGTTCGCTC GGCTCCTCGT GACACAGATT CGGCAGTGGC AGCGTCACCT CGACGTGTTC
TGTGAACGGC TCCGTTCGGA CCGGGACCTG CTGTGTGACC GATTCGGAAG CGGAGACGAT
CTCGGCCCGG TCGTGAGCGT CAAACCGCTC GCCGACGACA CGCATGGCGA CGGACGTGCT
GTCATGCGTG TTACGTTCGA CGCGGGGCTC AGTGTGGTGT ACAAGCCCCG GAGCGTGGCT
GCCGGCGAAG CGCTGTACGA CACGCTCGAG GCGATCGATG AGCACCTGTC CTGTCCGTCG
TTCGACACTC CGACGTATCT CGACCGCGAT GCCTACGGGT GGATGGAATG GATCGACCAC
GAGCCGTGCC AGGACGACAG TGAGGTCGAG CGGTACTACC GTCGAGCTGG TGTGTGGCTC
TGTCTGGCGC ATCTCTTCGA GTTTTCCGAC TGCCACTTCG AGAACGTGAA AGCGGCTGGC
GACCAGCCAC TGCTGGTCGA CTCCGAGACC GTCTTCCATC CTTACTTCGA CGCGGAGCGG
CGACCTGGGA GCGGCGATAT CGGGACCCTC ACGGACGATA GCACGCTCTT GACTTCTTTG
CTGCCGTACG ACGTCACATC AGCACATGAC ACCGATGCCA AACAGTCACG AATGCGGGAA
CGTATCGCCG GTTTCGGCGA ACGGTCGGGC GAAGTAACGC TCGACGGTAT TCAGGTCCCA
CAGATCGTCG CCGAAAACAC AGACGTCATG TCTGTCGAGG ACGAGCCCGC GACACTGGAT
CGGGACGAGA CGATTCCCGT GGTTGACGGA GACGATCACC CACCAGACGC GTACATCGAG
GTGCTCGTCG ATGGTTTTCG GGAGGCGTAC GAAACTGTCC TCGACCTCAG AGACTCGGGC
GCGTTGGAGG AGTCGATCGC CGTTTTCGAT CGATTCGAGG GCCTCAGGAA CCGGCTGGTG
TACCGGCCGA CGATGGAGTA CGCGAAGGTG TTACGCGATC TCACTTCCAG GGACTGCCTG
GGCGACGGCG TCCGTTTCGG CGTCGAGCTA GAGCAGTTGT CGACGCCGTT TTTCGACGGG
AGCATCACCG ACCGAAAGCC GTGGGCACTG TACGAGGCGG AGCGGACGGC GCTCAGACGC
CTCGATCCGC CTCGGTTCAC CTCGCGGACG GACGAACACG AAATAGAGTG GGGCGGTACG
GGTCTCGGCA TCACGGCGAG CCAAACCGGT TTCGAGCGCG CCAGAGAGCG AATCGAACGG
GCCGACCGGT CCGAACTCCG GAAACAACTC GAGATCATCC GGGGATGTTA CGGGGAACCG
CCCCAGGGAG AGGTACACGC GGACGTATCG GGAACGTCGC CGCAACGACC CAGTAACGAC
GATGCGTTCC TCCGCGAGAG CAAGCGGTTG CTCCGAACCG TCCGGTCTAG TGCTCGTGAG
ACTGCGGACG GACACTACCA GTGGGCGAGC ATCGCACCCT ATCACGAGAC CGACCGCTTG
ACCATCCAGC CGGCCGGTGG TTCGCTGTAC GTCGGTGGTG TCGGTATCGG TCTACTGGGA
GCGGCACTGT TTGCCGTGGA CGGCGACAGT CGATACGCGG AGTTCGCTCG TTCGGCCGTC
GGTCCGATTC GCGATGCGGT GCGCACCCAG CGGGAGGTCC CGGCGTTCGA AAACCTCGGG
GGGGCCCTCG GTGTCGGGTC GGTGGCGTAC GGGCTGAGTG TTATCGGAAC CATGATCGAC
GACCGAGAGA TGCTCGAGGA TGCAACGCGA GTCGCCAACC GGTTTCCCGA AGCGCGGATC
GCCGAAGACG AAACCTACGA CGCTGTCGGT GGGTCTGCAG GCACTATACT GGGGCTTCTC
GGGGCGTACA ACCGGATCGA GTCGCCGGAA CTGCTGTCGC TGGCCGAGGC GTGTGGTGAC
AGGTTACTGG ACGCGAGACA GACCCTGGAC GGCGTCGGCG TCTGGAAAAC GGTGCCGGAC
TGTCCACCTC TGACGGGGAT GGCACACGGT GTTAGCGGCA TCGCGTACGC GTTGGTTCGG
TTGTGGGACG CGACGGGCAA TCAATCATAT CTCGATGCGG CGACTGAGGC ACTGGCGTAC
GAACGCGACG CGTACGTCCC TGAGGCAGAC AACTGGATCG ACTATCGGCC GTGGACCGAC
AGGCACCCCG ACCAGTGGTG TTACGGCCGA AGCGGGATCG GGCTCGCACG GCTGGGCATG
GCAGACTACC TCACAGATCC GTCGATAGAG CGCGACATCG AACGTGCGAC GTCGGGGCTC
TCCGACGTTC AGACCACGAC TGTCGACCAC CTCTGCTGTG GCTCTGCTGG CCGAGCGGCG
TTTCTCCTGG CGCTCCAGCG TCGACGCCAG CGCCACGAGG GCGCTGCTCG CCGCACCCTC
GGTGACGTCC TCGGCAGTCG GCGTGCCAAC GGCCACTATC GGACGCTGTC GGAAACTGCC
GAAATTGTTG ACCCGACCTT CTTCCAAGGC GTCTCCGGGA TCGGATACGC CTACCTACGA
CTCTGTGACC CAGACGAACT GCCGTGTATC CTGCTGTGGG AGTGA
 
Protein sequence
MTAVFTDAEK RSIVGQARTL HERVQTLDEY DQDVEHDERI EALWDEWRSQ FPSDGSFEDR 
IEWSDVSESE WRRAIEVETL NEGASVPAWV DRLEAAVAAI QDRSPERADT RFDVDTDERR
LLGELTAALA DYVCDRVPAE VEEALTDQAI AEMGEWFRTR FQNRFSRILF VEFKSFVAAH
DRELAFADPN DFDEPPSEYY DRFVAYLFDG GFVDLCQEYP VFARLLVTQI RQWQRHLDVF
CERLRSDRDL LCDRFGSGDD LGPVVSVKPL ADDTHGDGRA VMRVTFDAGL SVVYKPRSVA
AGEALYDTLE AIDEHLSCPS FDTPTYLDRD AYGWMEWIDH EPCQDDSEVE RYYRRAGVWL
CLAHLFEFSD CHFENVKAAG DQPLLVDSET VFHPYFDAER RPGSGDIGTL TDDSTLLTSL
LPYDVTSAHD TDAKQSRMRE RIAGFGERSG EVTLDGIQVP QIVAENTDVM SVEDEPATLD
RDETIPVVDG DDHPPDAYIE VLVDGFREAY ETVLDLRDSG ALEESIAVFD RFEGLRNRLV
YRPTMEYAKV LRDLTSRDCL GDGVRFGVEL EQLSTPFFDG SITDRKPWAL YEAERTALRR
LDPPRFTSRT DEHEIEWGGT GLGITASQTG FERARERIER ADRSELRKQL EIIRGCYGEP
PQGEVHADVS GTSPQRPSND DAFLRESKRL LRTVRSSARE TADGHYQWAS IAPYHETDRL
TIQPAGGSLY VGGVGIGLLG AALFAVDGDS RYAEFARSAV GPIRDAVRTQ REVPAFENLG
GALGVGSVAY GLSVIGTMID DREMLEDATR VANRFPEARI AEDETYDAVG GSAGTILGLL
GAYNRIESPE LLSLAEACGD RLLDARQTLD GVGVWKTVPD CPPLTGMAHG VSGIAYALVR
LWDATGNQSY LDAATEALAY ERDAYVPEAD NWIDYRPWTD RHPDQWCYGR SGIGLARLGM
ADYLTDPSIE RDIERATSGL SDVQTTTVDH LCCGSAGRAA FLLALQRRRQ RHEGAARRTL
GDVLGSRRAN GHYRTLSETA EIVDPTFFQG VSGIGYAYLR LCDPDELPCI LLWE