Gene Hlac_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1683 
Symbol 
ID7400440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1702296 
End bp1704167 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content70% 
IMG OID643708752 
Productpeptidase M50 
Protein accessionYP_002566338 
Protein GI222480101 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.200321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC TGACGGACCT GTTCGCCGAG AACACGCTGT TGTGGGTCCT CACGGGGGTC 
CTCGCGTACT CCGCGGCCGC GATGTGGCTT CGGAACCGCG GGGCCCTCCC CGACTCTGTC
CGCGTCTCCG GACCTGTATT GACCCTCCGC ACCCTCCGTG GTCGAGCGTT CCTCGACCGG
CTCGCCGCGC CGAAGCGTCT CTGGCGGGCC GTCGCGAACC TCGGACTCGG CGGTGCGCTC
GTCGCGATGG TGGGATCGTT CATCCTCATC CTCTCATCGG CGCTCTCGGC GATTCGCACC
GTCCAGCCCT CCGCGATCCA ACAGCCGCAG AACTTCCTCA TTATCCCCGG CGTGAACGAC
TTCCTCCCCC TGTCGGTCGC ACCCGAGATC GTCGCCGGGC TCGCGGTCGC GATGGTCGTC
CACGAGGGGG CACACGGCCT GCTGTGCCGC GTCGAGGGGA TCGACATCGA GTCGATGGGG
CTCGTCTTCT TCACGTTCCT CCCCGTCGGC GCCTTCGTCG AGCCGAACGA GGAGGCGACC
CAGGAGGTCT CCCGCGGCGC CCGTGCCCGG ATGTTCGCCG CCGGCGTCAC CGCGAACACG
GTCCTCACGG TCCTCGTCTT CGCGCTGCTT TTCGGTCCGG TCGTCGGCGC CATCGCGCCG
GCGCCCGGCT ACGCCGTTGG CGAGGTGACC CCCGAGTCGC CGGCCGCAGC GGCCGATATC
GCGCACGGCG ACCGGTTGGT CGCGGTCGCG GGGACGCCGG TCGACACCGC GGACGAGTTC
GAGGCCGCGA TCGCCGACGC CGGCGAGACG GTGAGCGTCA CGGCCGACGA CGGTGACGGA
GAGCGGACCG TCGAGGTTGA GCGCCGGCTC CACGTGATCG GGTCCGCAGG TGGGAACCCG
CTCGGCGTGA CGATCGAGTC GGAGCCGGTG GCGATCACGT CCGTCAACGG CGAGTCCGTG
GCGACCGAGC AACAGTTCCT CGACGCCGTC GGCGACGCCG AGCGCGCGAC GGTGACCGTC
GACGCCGACG GCGCGGCCAA CGCCACGATC GATTCCGAGA CGACCGAGAT CCCGATCGGT
GCGTACGCCC TCGGCGTGCA GGAGGACGGG CCGCTTCACG CCGAGGGTGC GCCCCTCGGT
GAGCCGCTGA CGATCGTCTC GATCGACGGG GAGCGCGTCC GGAATAACGA CGAGCTCTCG
GCAGTGCTCG GCGAGCGCGA GCCCGGAACG ACCGCCGAGG TCGTCGCGTA CGACGCCGAC
GACGAGCGCG TCAGCTACGA CGTTGAGCTC GACTCGCACC CGAACCGCGA CGGCGGGTTC
ATCGGCGTGG GCGTCTTCCC CGGCTCCAGC GGACTCGCGC TCGACGACTT CGGCGTCTCG
GAGTACCCGG CCGGCGCGTA CCTCGAACTG CTCGGCGGCG ACGGCGGCGA GGGTGCGACC
AACGGGCTCG CGCTCACCGG GCTCACGGAC TCCCCGCTCG GGCTCGTGTT CGCCTCGCTC
ATCCTCCCGC TCGGCAGCCT GTTCGGGCTC CCGTTCAACT TCGCGGGATT CACCGGCGAG
ATGACCAACT TCTACGTGGT TGAGGGCGCG TTCGCCGCCT TCGGCGGCGG GACGTTCCTG
CTCGCGAACC TCCTCTTCTG GACCGGGTGG ATCAACATCC AGCTCGCGCT GTTCAACTGC
CTGCCGGCGT TCCCGCTCGA CGGGGGCCGC ATCCTTCGGA TGGTCGCCGA GGCGGTGGTC
AGCCGGGTCC CGATCTCCGA CCGGCATGCC GCGGTCCGAA CGATCACAGT CTCCTCCGGG
CTCGTGATGC TCGCCGGGCT GATCATGATG ATCTTCGGCA ACCAGATCCT CGGGGCGCTC
GGGCTGCTCT AG
 
Protein sequence
MNALTDLFAE NTLLWVLTGV LAYSAAAMWL RNRGALPDSV RVSGPVLTLR TLRGRAFLDR 
LAAPKRLWRA VANLGLGGAL VAMVGSFILI LSSALSAIRT VQPSAIQQPQ NFLIIPGVND
FLPLSVAPEI VAGLAVAMVV HEGAHGLLCR VEGIDIESMG LVFFTFLPVG AFVEPNEEAT
QEVSRGARAR MFAAGVTANT VLTVLVFALL FGPVVGAIAP APGYAVGEVT PESPAAAADI
AHGDRLVAVA GTPVDTADEF EAAIADAGET VSVTADDGDG ERTVEVERRL HVIGSAGGNP
LGVTIESEPV AITSVNGESV ATEQQFLDAV GDAERATVTV DADGAANATI DSETTEIPIG
AYALGVQEDG PLHAEGAPLG EPLTIVSIDG ERVRNNDELS AVLGEREPGT TAEVVAYDAD
DERVSYDVEL DSHPNRDGGF IGVGVFPGSS GLALDDFGVS EYPAGAYLEL LGGDGGEGAT
NGLALTGLTD SPLGLVFASL ILPLGSLFGL PFNFAGFTGE MTNFYVVEGA FAAFGGGTFL
LANLLFWTGW INIQLALFNC LPAFPLDGGR ILRMVAEAVV SRVPISDRHA AVRTITVSSG
LVMLAGLIMM IFGNQILGAL GLL