Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0724 |
Symbol | |
ID | 7400197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 739979 |
End bp | 741031 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707790 |
Product | peptidase M42 family protein |
Protein accession | YP_002565396 |
Protein GI | 222479159 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.143036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.785387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCG AGTTTGATTA CGATCTGCTC CGTGAGTTGA CTGAGGCCCG AGGCGTCCCG GGATACGAAG ACGAAGTCCG CGAGATCGTC CGCCGCGAGT TCGCCGATCG CGCCGACCGC GTTCGCACCG ACGCGATGGG AAACGTCGTC GCCACGCTCG AGGGTGACTC TGATTACTCG GTCGCCGTCG CGGCCCACAT GGATGAAATC GGCTTCATGG TCCGACACGT CACCGACGAG GGGTTCGTCC AGGTGGATCC GCTCGGTGGG TTCGACGCCC GGGTGCTGCG CGCACAGCGC GTCACCGTCC ACGGCGAGGA GGATCTCACC GGCGTCATCG GCTCCGTCCC GCCGCACACG CTCACGGACG AGCAGAAGGA GAAGGATGAC GAGGTCTCGG ACGTGTTCAT CGACGTCGGG CGCGACGCCG AGGCGGTCGA AGAACTCGTC GGCGTCGGCG ATCTGGTCAC CCTCGATCAG ACGACGACCC GCATGGGCGA TCGGATCACG GGGAAGGCGC TCGACGACCG GATCTGCCTG TTCGCGACGC TTGAGGCCGC AAAGCGAATC GAGGATCCCG ACGTGACGAT CCACTTCGCG GCGACGGTTC AAGAGGAGGT CGGGATCCGC GGCGCGACCG CACTCGGCGT CGACATCGAC CCCGACCTCG CGATCGCCTT GGACGTGACC GTCGCGAACG ACGTACCCCA GATCGGCGAA CCGGCCGACG CCGTGACGGA GCTCGGCGAG GGGACCGCGA TCAAACTGAA AGACTCGTCG GTGATCACCA GCCCGAAGGT CCACAAGCGG CTCACTGCGG TCGCCGAAGC GGAGGCGATC GATCACCAAC ACGAGGTGTT GCCCGCGGGC GGCACCGACA CCGCCGGGTT TCAGAATACT GCCGGTGCAA AGCCTGTCGG CGCCATCTCG ATCCCGACGC GGTACCTCCA CACCGTCACC GAAACCGCCG ACGGCGACGA CGTGGCCGCG ACGATCGACC TGCTGACGGC CTTTTTGGAG TCCGAGTCCG GAGAACACGA CTACACGCTG TAG
|
Protein sequence | MSFEFDYDLL RELTEARGVP GYEDEVREIV RREFADRADR VRTDAMGNVV ATLEGDSDYS VAVAAHMDEI GFMVRHVTDE GFVQVDPLGG FDARVLRAQR VTVHGEEDLT GVIGSVPPHT LTDEQKEKDD EVSDVFIDVG RDAEAVEELV GVGDLVTLDQ TTTRMGDRIT GKALDDRICL FATLEAAKRI EDPDVTIHFA ATVQEEVGIR GATALGVDID PDLAIALDVT VANDVPQIGE PADAVTELGE GTAIKLKDSS VITSPKVHKR LTAVAEAEAI DHQHEVLPAG GTDTAGFQNT AGAKPVGAIS IPTRYLHTVT ETADGDDVAA TIDLLTAFLE SESGEHDYTL
|
| |