Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2488 |
Symbol | |
ID | 7401540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2465762 |
End bp | 2467177 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643709560 |
Product | glycoside hydrolase family 68 |
Protein accession | YP_002567131 |
Protein GI | 222480894 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.266603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.590215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGAGA CGCCGGAGAG CGACGGCGGT CGATCGCGCT CGCGGTGGAC TAGAGAGCAG GCCGCGTCGA TCGAGCGCCG CCGCGGGAAC ATCGCCCCGC CGGCGGGTGC CCCCGAGCTC GACCCCTTCC CGGATCTCCA CGTCTGGGAC ACGTGGCTGC TCCGCGACCG ACACGGCGAG ATTGCCGACG TGAACGGGTA CCGACTGGCG TTCTCGCTGA CTGCCCCGGC TGACCTGCTC CCCGGAAAAC GCCACGATGT GGCCGAGATC CGGTGTTTCT ACTCCGCTGA CGGGAAGCGA TGGCACGACG CCGGACCGGT CTTCGACGGC GGCGCGCTCG GCCAGCGCCA GTGGGCCGGC TCCGCGCTGT ACGACGACGG CGAGGTCTAC CTCTACTACA CCGCCGCCGG CGACGAGGCC GCCGACGAGA TGACGTACAC TCAGCGGATC GCGGTCGCGC ACGGCGGAAC GGCGAGCGCC GACGAGGACG GGATCGAACT CTCCGGTCCG TGGACTCACG AGACGCTACT GACGCCGGAC GGCGAGTGGT ACGAGACCGA GGCACAGTCG CGCGGGATGA CCTACACCTT CCGGGACCCG TGGTTCTTCG AGGACTCGGC GACCGGCGAG ACGCACCTGC TGTTCGAGGC GAACGCGCCC GCGCCGGAGC GACCGGGCGA CGACGAGGCG ACCGCCCACC GCCGGGAGTT CAACGGCTGC GTCGGCGTCG CCGTCTCGGA GTCGGGCGAC CCGCTCTCGT GGGAGCTTCG CCCGCCCCTG CTGGACGCGG TCGAAGTCAA TCAGGAACTG GAGCGCCCGC ACGTCGTCGT CGCCGACGGG CGCTACTACC TGTTCGTCTG CAGCCACGTC CACACGTTCG CGCCGGGCGT GACCGGGCCG GACGGGCTCT ACGGCTTCGT CGCCGACGCG CTCGACGGCG AGTACCGCCC CCTGAACGGC TCCGGGCTCG TCGCCACGAA CCCGCCCGAA GCGCCGTTTC AGGCGTACTC GTGGATGGCG TTCGCCCACG ACGAGGAGGT GCTCGTCCAG AGCTTCCTCA ACTACTACGA CTTCGCGGGC GACTCGCTCG ACGCGATCGC CGACCTCCCC GAGGCCGAGC AGCGCGAGCG GTTCGGCGGG ACGCTCGCCC CGACCCTCCG ACTCGCGCTC GACGGCGACA GCACCCGGCT GCGCGGAACG CTCGACGCGT GGCGTATTCC CACCCCGGAC GAGCCGCTAC CGCCGGCCGA CGATTCGGAA CTCCCGGGCG GCGACGCGCT CAGCGGTCGG CTCCGAGAAG GCGGGAGCGG CGGCGGGTAC GCCGGGGGAC CGAGCGGCGC GGTCGACAGC GAAGGCGATA ACTCGCTCGG CGACGCCGAA ACGGGAGACG ACGGCGGCTC TCACGGCGCT ATTTAA
|
Protein sequence | MHETPESDGG RSRSRWTREQ AASIERRRGN IAPPAGAPEL DPFPDLHVWD TWLLRDRHGE IADVNGYRLA FSLTAPADLL PGKRHDVAEI RCFYSADGKR WHDAGPVFDG GALGQRQWAG SALYDDGEVY LYYTAAGDEA ADEMTYTQRI AVAHGGTASA DEDGIELSGP WTHETLLTPD GEWYETEAQS RGMTYTFRDP WFFEDSATGE THLLFEANAP APERPGDDEA TAHRREFNGC VGVAVSESGD PLSWELRPPL LDAVEVNQEL ERPHVVVADG RYYLFVCSHV HTFAPGVTGP DGLYGFVADA LDGEYRPLNG SGLVATNPPE APFQAYSWMA FAHDEEVLVQ SFLNYYDFAG DSLDAIADLP EAEQRERFGG TLAPTLRLAL DGDSTRLRGT LDAWRIPTPD EPLPPADDSE LPGGDALSGR LREGGSGGGY AGGPSGAVDS EGDNSLGDAE TGDDGGSHGA I
|
| |