Gene Hlac_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2601 
Symbol 
ID7399827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2578912 
End bp2580096 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content67% 
IMG OID643709674 
Productpeptidase M50 
Protein accessionYP_002567243 
Protein GI222481006 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.603215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.812434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGAA TCAAGATCGG GACGGTGCTG GGGATCCCGG TCAGGCTCAA CTGGACGTTT 
CTGATCGTAC TGCCGCTTTT CGCCTACCTC ATCGGCTCGC AGGTCGGGAT GATCGCCGAG
GTGATGAACG AGGCGTTCAG CGCCGGCATC GACCCCGCCG CGCTCGGTGC GGGGCTCACG
CCGTGGGCAC TCGGGTTGGC GGCCGCACTC GGGCTGTTCG GCGGCGTCCT CCTCCACGAG
TTCGGCCACT CGATCGTCGC CATGCGGTAC GGGTACGAGA TCGAGTCGAT CACCCTGTGG
CTGCTCGGCG GGATCGCCAG CTTCACCGAG TTCCCCGAGG ACTGGAAACA CGAGTTCTGG
ATCGCGATCG CGGGACCGGT CGTCAGCGTC GCCGTCGGGC TCGTCTGTTA CGGCGTGTTC
GTGCTCGCGC CGCTCGGCTC GAACGCCGTG TTGTTCGTCT TCGGCTACCT CGCGCTGTTG
AACATCGTGC TCGCGGTGTT CAACATGCTT CCCGCCTTCC CGATGGACGG CGGGCGCGTC
CTTCGGGCGC TCCTCGCGCG GAACCAGCCG CACGCGCAGG CGACCCAGCG CGCAGCCGCG
ATCGGGAAGG TGTTCGCCTT CTTCATGGGA CTGATCGGAC TGTTCACCTT CCAGCTCCTG
CTGATCGTGT TGGCCTTCTT CATCTACATC GCCGCCTCCG GCGAGGCCCA GCAGACGACG
CTGAAGGCCG CCTTCGAGGA CGTCACCGTC GCCGACGTGA TGACCCGCCG CGAGGACCTC
CACACCGTCA CCGGAGACAC CTCTGTCGCG GATCTGATGA GCCGGATGTT CGAGGAGCGC
CACACCGGCT ACCCCGTGCT CCACGGCGGC AACCTCGTCG GGATGGTGAC CTTAGAGGAC
GCCCGATCGG TCCGGGATGT CGAGCGGGAC GCCTACCAGG TCGCAGACGT GATGGAGACC
GAAGTGGTCG GCGTCGGTCC CGAGGCCGAC GCGATGACCG CGCTCCAGAC GATGCAGGAG
AACGGCGTCG GCCGGCTCCC GGTCGTCGAT CGGAGCGACG AGCTGGTCGG ACTCATCTCC
CGTTCGGACC TGATGACCGC GTTCAACATC ATCCAGACGG GTGGCACTCC GAGCCTCATC
AGCGGACGCC GACAGGGGGC CGAAGGCGGC CCCGGCGTGT TCTGA
 
Protein sequence
MRGIKIGTVL GIPVRLNWTF LIVLPLFAYL IGSQVGMIAE VMNEAFSAGI DPAALGAGLT 
PWALGLAAAL GLFGGVLLHE FGHSIVAMRY GYEIESITLW LLGGIASFTE FPEDWKHEFW
IAIAGPVVSV AVGLVCYGVF VLAPLGSNAV LFVFGYLALL NIVLAVFNML PAFPMDGGRV
LRALLARNQP HAQATQRAAA IGKVFAFFMG LIGLFTFQLL LIVLAFFIYI AASGEAQQTT
LKAAFEDVTV ADVMTRREDL HTVTGDTSVA DLMSRMFEER HTGYPVLHGG NLVGMVTLED
ARSVRDVERD AYQVADVMET EVVGVGPEAD AMTALQTMQE NGVGRLPVVD RSDELVGLIS
RSDLMTAFNI IQTGGTPSLI SGRRQGAEGG PGVF