Gene Hlac_0724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0724 
Symbol 
ID7400197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp739979 
End bp741031 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content66% 
IMG OID643707790 
Productpeptidase M42 family protein 
Protein accessionYP_002565396 
Protein GI222479159 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.143036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.785387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCG AGTTTGATTA CGATCTGCTC CGTGAGTTGA CTGAGGCCCG AGGCGTCCCG 
GGATACGAAG ACGAAGTCCG CGAGATCGTC CGCCGCGAGT TCGCCGATCG CGCCGACCGC
GTTCGCACCG ACGCGATGGG AAACGTCGTC GCCACGCTCG AGGGTGACTC TGATTACTCG
GTCGCCGTCG CGGCCCACAT GGATGAAATC GGCTTCATGG TCCGACACGT CACCGACGAG
GGGTTCGTCC AGGTGGATCC GCTCGGTGGG TTCGACGCCC GGGTGCTGCG CGCACAGCGC
GTCACCGTCC ACGGCGAGGA GGATCTCACC GGCGTCATCG GCTCCGTCCC GCCGCACACG
CTCACGGACG AGCAGAAGGA GAAGGATGAC GAGGTCTCGG ACGTGTTCAT CGACGTCGGG
CGCGACGCCG AGGCGGTCGA AGAACTCGTC GGCGTCGGCG ATCTGGTCAC CCTCGATCAG
ACGACGACCC GCATGGGCGA TCGGATCACG GGGAAGGCGC TCGACGACCG GATCTGCCTG
TTCGCGACGC TTGAGGCCGC AAAGCGAATC GAGGATCCCG ACGTGACGAT CCACTTCGCG
GCGACGGTTC AAGAGGAGGT CGGGATCCGC GGCGCGACCG CACTCGGCGT CGACATCGAC
CCCGACCTCG CGATCGCCTT GGACGTGACC GTCGCGAACG ACGTACCCCA GATCGGCGAA
CCGGCCGACG CCGTGACGGA GCTCGGCGAG GGGACCGCGA TCAAACTGAA AGACTCGTCG
GTGATCACCA GCCCGAAGGT CCACAAGCGG CTCACTGCGG TCGCCGAAGC GGAGGCGATC
GATCACCAAC ACGAGGTGTT GCCCGCGGGC GGCACCGACA CCGCCGGGTT TCAGAATACT
GCCGGTGCAA AGCCTGTCGG CGCCATCTCG ATCCCGACGC GGTACCTCCA CACCGTCACC
GAAACCGCCG ACGGCGACGA CGTGGCCGCG ACGATCGACC TGCTGACGGC CTTTTTGGAG
TCCGAGTCCG GAGAACACGA CTACACGCTG TAG
 
Protein sequence
MSFEFDYDLL RELTEARGVP GYEDEVREIV RREFADRADR VRTDAMGNVV ATLEGDSDYS 
VAVAAHMDEI GFMVRHVTDE GFVQVDPLGG FDARVLRAQR VTVHGEEDLT GVIGSVPPHT
LTDEQKEKDD EVSDVFIDVG RDAEAVEELV GVGDLVTLDQ TTTRMGDRIT GKALDDRICL
FATLEAAKRI EDPDVTIHFA ATVQEEVGIR GATALGVDID PDLAIALDVT VANDVPQIGE
PADAVTELGE GTAIKLKDSS VITSPKVHKR LTAVAEAEAI DHQHEVLPAG GTDTAGFQNT
AGAKPVGAIS IPTRYLHTVT ETADGDDVAA TIDLLTAFLE SESGEHDYTL