Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2234 |
Symbol | |
ID | 7399943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2219556 |
End bp | 2220620 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643709307 |
Product | peptidase M42 family protein |
Protein accession | YP_002566881 |
Protein GI | 222480644 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.323516 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGACT CACAGCGAGC GTTCCTCAAC GACCTACTCG CCACCGCTAG CCCCTCCGGC TTCGAGACGC CGAGCCAGCG AGTCTGGACC GACTACGTTC GCGGCTTCGC CGACGAGGTC TCCGTCGACG CCTACGGCAA CGCCGTCGCC GTTCACGAGG GCGACCCCGA CGCGCCCACC ATCGCCCTGA CCGGCCACGC CGACGAGATC GGATTCATCG TCCGCGACGT GCTCGACGAC GGTTTCCTGC GGATCTCCCG GATCGGCGGC TCCGACCGCA CCGTCTCGAA GGGCCAGCAC GTCACCGTCC ACGCCGACGA GCCGGTGCAG GGCGTGATCG GTCAGACCGC GATCCACCTG CGGGACCGCT CGGAAGACGA GTACGAGAAG ATCGCCGAGC AGTTCGTCGA CATCGGCGCG GCTGACGCCG AAGAGGCGCG CGAGTGCGTC GAGATCGGCG ATCCCGTCAC ATTCTCGACC GAGGTGGAAG AGCTGGTTGG CGACCGGATC GCCGCCCGCG GTATCGACAA CCGGACCGGC ACGTGGGCAG CCGCGGAAGG GCTCCGCCGC GCGACCGAGC GTGACATCGA CGCCACCGTC TACGCCATTT CCACGGTACA GGAGGAGGTC GGGCTCCAGG GCGCCCAGAT GGTCGGCGTC GACCTCGAGA CGGTGGACGC GTTCGTCGCC GTCGACGTCA CTCACGCCAC CGATAACCCC GATGTCGACG GAGAACACCG AGGCCCGGTC GAGCTCGGCT CCGGACCCGT GATCGCCCGT GGCAGCGCGA ACCACCCCGT CCTCGTCGAC CTCGCGCGCG ACGCCGCGGC CGCTGCCGAC ATCGACGTAC AGCTACAGGC GGCCGGCACG CGAACTGGTA CCGACGCCGA CGCCTTCTAC ACCGTTCAGG GCGGTGTCCC GTCGCTCAAC GTCTCGATCC CGAACCGCTA CATGCACACC CCGGTCGAAG TGGTCGACAT CGCCGACCTC GATGCCGTCG CCGATCTCCT CGCCGCGATC GCCGACGGCG CGGGCGACGC CACGCCCTTC GCCGTCGACG TGTGA
|
Protein sequence | MRDSQRAFLN DLLATASPSG FETPSQRVWT DYVRGFADEV SVDAYGNAVA VHEGDPDAPT IALTGHADEI GFIVRDVLDD GFLRISRIGG SDRTVSKGQH VTVHADEPVQ GVIGQTAIHL RDRSEDEYEK IAEQFVDIGA ADAEEARECV EIGDPVTFST EVEELVGDRI AARGIDNRTG TWAAAEGLRR ATERDIDATV YAISTVQEEV GLQGAQMVGV DLETVDAFVA VDVTHATDNP DVDGEHRGPV ELGSGPVIAR GSANHPVLVD LARDAAAAAD IDVQLQAAGT RTGTDADAFY TVQGGVPSLN VSIPNRYMHT PVEVVDIADL DAVADLLAAI ADGAGDATPF AVDV
|
| |