Gene Hlac_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2234 
Symbol 
ID7399943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2219556 
End bp2220620 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content69% 
IMG OID643709307 
Productpeptidase M42 family protein 
Protein accessionYP_002566881 
Protein GI222480644 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.323516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGACT CACAGCGAGC GTTCCTCAAC GACCTACTCG CCACCGCTAG CCCCTCCGGC 
TTCGAGACGC CGAGCCAGCG AGTCTGGACC GACTACGTTC GCGGCTTCGC CGACGAGGTC
TCCGTCGACG CCTACGGCAA CGCCGTCGCC GTTCACGAGG GCGACCCCGA CGCGCCCACC
ATCGCCCTGA CCGGCCACGC CGACGAGATC GGATTCATCG TCCGCGACGT GCTCGACGAC
GGTTTCCTGC GGATCTCCCG GATCGGCGGC TCCGACCGCA CCGTCTCGAA GGGCCAGCAC
GTCACCGTCC ACGCCGACGA GCCGGTGCAG GGCGTGATCG GTCAGACCGC GATCCACCTG
CGGGACCGCT CGGAAGACGA GTACGAGAAG ATCGCCGAGC AGTTCGTCGA CATCGGCGCG
GCTGACGCCG AAGAGGCGCG CGAGTGCGTC GAGATCGGCG ATCCCGTCAC ATTCTCGACC
GAGGTGGAAG AGCTGGTTGG CGACCGGATC GCCGCCCGCG GTATCGACAA CCGGACCGGC
ACGTGGGCAG CCGCGGAAGG GCTCCGCCGC GCGACCGAGC GTGACATCGA CGCCACCGTC
TACGCCATTT CCACGGTACA GGAGGAGGTC GGGCTCCAGG GCGCCCAGAT GGTCGGCGTC
GACCTCGAGA CGGTGGACGC GTTCGTCGCC GTCGACGTCA CTCACGCCAC CGATAACCCC
GATGTCGACG GAGAACACCG AGGCCCGGTC GAGCTCGGCT CCGGACCCGT GATCGCCCGT
GGCAGCGCGA ACCACCCCGT CCTCGTCGAC CTCGCGCGCG ACGCCGCGGC CGCTGCCGAC
ATCGACGTAC AGCTACAGGC GGCCGGCACG CGAACTGGTA CCGACGCCGA CGCCTTCTAC
ACCGTTCAGG GCGGTGTCCC GTCGCTCAAC GTCTCGATCC CGAACCGCTA CATGCACACC
CCGGTCGAAG TGGTCGACAT CGCCGACCTC GATGCCGTCG CCGATCTCCT CGCCGCGATC
GCCGACGGCG CGGGCGACGC CACGCCCTTC GCCGTCGACG TGTGA
 
Protein sequence
MRDSQRAFLN DLLATASPSG FETPSQRVWT DYVRGFADEV SVDAYGNAVA VHEGDPDAPT 
IALTGHADEI GFIVRDVLDD GFLRISRIGG SDRTVSKGQH VTVHADEPVQ GVIGQTAIHL
RDRSEDEYEK IAEQFVDIGA ADAEEARECV EIGDPVTFST EVEELVGDRI AARGIDNRTG
TWAAAEGLRR ATERDIDATV YAISTVQEEV GLQGAQMVGV DLETVDAFVA VDVTHATDNP
DVDGEHRGPV ELGSGPVIAR GSANHPVLVD LARDAAAAAD IDVQLQAAGT RTGTDADAFY
TVQGGVPSLN VSIPNRYMHT PVEVVDIADL DAVADLLAAI ADGAGDATPF AVDV