Gene Hlac_2488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2488 
Symbol 
ID7401540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2465762 
End bp2467177 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content71% 
IMG OID643709560 
Productglycoside hydrolase family 68 
Protein accessionYP_002567131 
Protein GI222480894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.266603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.590215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAGA CGCCGGAGAG CGACGGCGGT CGATCGCGCT CGCGGTGGAC TAGAGAGCAG 
GCCGCGTCGA TCGAGCGCCG CCGCGGGAAC ATCGCCCCGC CGGCGGGTGC CCCCGAGCTC
GACCCCTTCC CGGATCTCCA CGTCTGGGAC ACGTGGCTGC TCCGCGACCG ACACGGCGAG
ATTGCCGACG TGAACGGGTA CCGACTGGCG TTCTCGCTGA CTGCCCCGGC TGACCTGCTC
CCCGGAAAAC GCCACGATGT GGCCGAGATC CGGTGTTTCT ACTCCGCTGA CGGGAAGCGA
TGGCACGACG CCGGACCGGT CTTCGACGGC GGCGCGCTCG GCCAGCGCCA GTGGGCCGGC
TCCGCGCTGT ACGACGACGG CGAGGTCTAC CTCTACTACA CCGCCGCCGG CGACGAGGCC
GCCGACGAGA TGACGTACAC TCAGCGGATC GCGGTCGCGC ACGGCGGAAC GGCGAGCGCC
GACGAGGACG GGATCGAACT CTCCGGTCCG TGGACTCACG AGACGCTACT GACGCCGGAC
GGCGAGTGGT ACGAGACCGA GGCACAGTCG CGCGGGATGA CCTACACCTT CCGGGACCCG
TGGTTCTTCG AGGACTCGGC GACCGGCGAG ACGCACCTGC TGTTCGAGGC GAACGCGCCC
GCGCCGGAGC GACCGGGCGA CGACGAGGCG ACCGCCCACC GCCGGGAGTT CAACGGCTGC
GTCGGCGTCG CCGTCTCGGA GTCGGGCGAC CCGCTCTCGT GGGAGCTTCG CCCGCCCCTG
CTGGACGCGG TCGAAGTCAA TCAGGAACTG GAGCGCCCGC ACGTCGTCGT CGCCGACGGG
CGCTACTACC TGTTCGTCTG CAGCCACGTC CACACGTTCG CGCCGGGCGT GACCGGGCCG
GACGGGCTCT ACGGCTTCGT CGCCGACGCG CTCGACGGCG AGTACCGCCC CCTGAACGGC
TCCGGGCTCG TCGCCACGAA CCCGCCCGAA GCGCCGTTTC AGGCGTACTC GTGGATGGCG
TTCGCCCACG ACGAGGAGGT GCTCGTCCAG AGCTTCCTCA ACTACTACGA CTTCGCGGGC
GACTCGCTCG ACGCGATCGC CGACCTCCCC GAGGCCGAGC AGCGCGAGCG GTTCGGCGGG
ACGCTCGCCC CGACCCTCCG ACTCGCGCTC GACGGCGACA GCACCCGGCT GCGCGGAACG
CTCGACGCGT GGCGTATTCC CACCCCGGAC GAGCCGCTAC CGCCGGCCGA CGATTCGGAA
CTCCCGGGCG GCGACGCGCT CAGCGGTCGG CTCCGAGAAG GCGGGAGCGG CGGCGGGTAC
GCCGGGGGAC CGAGCGGCGC GGTCGACAGC GAAGGCGATA ACTCGCTCGG CGACGCCGAA
ACGGGAGACG ACGGCGGCTC TCACGGCGCT ATTTAA
 
Protein sequence
MHETPESDGG RSRSRWTREQ AASIERRRGN IAPPAGAPEL DPFPDLHVWD TWLLRDRHGE 
IADVNGYRLA FSLTAPADLL PGKRHDVAEI RCFYSADGKR WHDAGPVFDG GALGQRQWAG
SALYDDGEVY LYYTAAGDEA ADEMTYTQRI AVAHGGTASA DEDGIELSGP WTHETLLTPD
GEWYETEAQS RGMTYTFRDP WFFEDSATGE THLLFEANAP APERPGDDEA TAHRREFNGC
VGVAVSESGD PLSWELRPPL LDAVEVNQEL ERPHVVVADG RYYLFVCSHV HTFAPGVTGP
DGLYGFVADA LDGEYRPLNG SGLVATNPPE APFQAYSWMA FAHDEEVLVQ SFLNYYDFAG
DSLDAIADLP EAEQRERFGG TLAPTLRLAL DGDSTRLRGT LDAWRIPTPD EPLPPADDSE
LPGGDALSGR LREGGSGGGY AGGPSGAVDS EGDNSLGDAE TGDDGGSHGA I