Gene Hlac_0634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0634 
Symbol 
ID7401769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp654841 
End bp655872 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID643707700 
Productglutathione S-transferase 
Protein accessionYP_002565306 
Protein GI222479069 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0435] Predicted glutathione S-transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.240948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGC TCGTCGACGG CGAGTGGCGG ACCGACACGT ACGAGGCGAC CGACGGGGAC 
GGCTCGTTCC AGCGCGGCGA GACGACCTTT CGGAACTGGA TCGCCGGCAG CGACGTGCCG
GATCACATCG ACGCCGAGCC GGACGACCGA TTCCAACCCG AGGCGGGCCG GTACCACCTG
TACGTCTCGT ACGCCTGCCC GTGGGCCCAC CGGACGCTAC TCGCGCGATC GCTGCTCGGA
TTGGAAGACG CGATCGACGT CTCGGTCGTC GACCCGTACC GCGGCGAGGG GGGCTGGCAG
TTCACTCCCG AGAAGGAGGG CTGTACGCCC GACAAGCTCC ACGGCAGCGA CTATCTCCGA
GAGCTGTACA TCGAGGCCGA CCCCGACGCC ACCTGCCGCG TGACCGTCCC CGTGCTGTGG
GACACCGAGG AGGGGACGGT CGTCAACAAC GAGTCGCGCG AGATCCTTCG CATACTCTCG
ACTGCGTTCG ACGACCTCGG CAACGACGCG TCGCTGCTGC CCACCCCCGA CGACGAGGCG
ACCGTCGCGG ATGTCGACGA GGTGATCACC GAAATCTACG AACCCGTTAA CAACGGCGTC
TACCGCGCCG GGTTCGCAAC CTCGCAGGCG GCCTACGACG AGGCGATCGA CGAGCTGTTC
GGCGCGCTCG ATCGCTGGAA CGACCACCTC GCCGACCAGC GTTACCTCGT CGGCGACTCG
CTGACCGAGG CCGACATTTG CATGTTCACG ACGCTGGTCC GGTTCGACCA GGTGTACCAC
ACCCACTTCA TGTGTAACAA GAAGTTCGTC CACCAGTACG AGCACCTGTG GCCGTACCTG
CGGGATCTCT ACCAGACCGA GGGCGTCGCC GAGACGGTGA ACATGGCGCA CATCAAGGAG
CACTACTACA CGACGCACCC GGACGTGACC CCAACCGGCA TCATCGCGCG CGGTCCGGAC
CTCGACTGGG AGGCGGCGCA CGATCGCGAC CGGCTCGACG ACGAGGCGCC GACGCCGAAC
GCGGCTGACT GA
 
Protein sequence
MNQLVDGEWR TDTYEATDGD GSFQRGETTF RNWIAGSDVP DHIDAEPDDR FQPEAGRYHL 
YVSYACPWAH RTLLARSLLG LEDAIDVSVV DPYRGEGGWQ FTPEKEGCTP DKLHGSDYLR
ELYIEADPDA TCRVTVPVLW DTEEGTVVNN ESREILRILS TAFDDLGNDA SLLPTPDDEA
TVADVDEVIT EIYEPVNNGV YRAGFATSQA AYDEAIDELF GALDRWNDHL ADQRYLVGDS
LTEADICMFT TLVRFDQVYH THFMCNKKFV HQYEHLWPYL RDLYQTEGVA ETVNMAHIKE
HYYTTHPDVT PTGIIARGPD LDWEAAHDRD RLDDEAPTPN AAD