Gene Hlac_0632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0632 
Symbol 
ID7401767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp651759 
End bp653063 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content66% 
IMG OID643707698 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002565304 
Protein GI222479067 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.708501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA AACGCAGAAC GCTTCTGAAG ACGATGGGGG GATCGACCGC GCTCGCTGCG 
CTCGCGGGTT GTATCAGCAC CGGCGGCGAC GGCGGCGACG GCGGCGACGG CTCCGACGGG
TCAGACGGTT CCAATGGTTC GGACGGTTCC GACGGCTCCG ACGGTTCCGA CAGCGGAACG
ACGGGCACCA CGACGCTGTG GGCCGACCTC TCCGCCGCCG AGGACGAGGC CATGTCCGGC
TACATCGACG AGTACGAGTC GGATTCGGGC GACACCATCA ACAAGGAGGC GCCCGGCGGA
GAACTCGACC AGCAGCTCGA GACGGCGATT CCGGCCGGCG ACGGCCCCGA ATCGTGGATC
TGGGCGCACG ACTGGGTCGG CCGGTTTGCA GTCCGCGAGG AACCGCCGTT CCTGTACGAC
GCGAGTGACG ATGTCGACGT CTCGCTCGAC AGCTACACCG AGACCGCCCG GCAGGCCGCC
CAGTTCGACG GCGCCCTCCA CGGGCTCCCG TTCGCCTCCG AGACCGTCGC GCTGTTCTAC
AACGAGGACA TGGTCGACGA GCCGCCGGAG ACGATGGAAG AGATGGTCTC GATCATGGAC
GACCACCACG ACCCGGCCAA CGGGCAGTAC GGGCTCTCGT ACCCCGTGAC GGACCCCTAC
TTCGTCAGCG GGTTCATCCA GGCGTACGGC GGGGACATCT TCGATGAGGA GAACCTCAAG
GTGACCGTCG ACAGCGACGC GTGTAAGCAG GGCATAGACG CCCTCGAGAC GCTGTCCGAC
TACGTTCCGT CCGACCCCGG CTACGAGTCG CAGATCGTCG CGTTCGCGGA CGGGCTCGCG
CCGTTCGCGA TCAACGGCCC GTGGGAACTC GGCAACCTTC AGGACGAAAT CGACAACCTC
GGCGTCACGA CGCTGCCGAC CGTCGACGGG AACAACCCGC GTACGTACTC CGGGATCCAG
CTGTTCTACT TCAGCTCGAT GCTGGCGGAC GCCGACCAAT CGACGGTCGA CGCCACGACC
GGGCTCGCCG AGTGGTACAC GACCAACGAG GACATCGTCC TGAGCAACGC CGACGAACAG
GGGTATATCC CCGTCCTCAC GAACGTCGTC GACAACGACG ACCTCTCCAG CGAGGTTCAG
GCGTTCGCCC AACAGGTCGA TCACGGTGTC CCCATCCCGA CACACCCCGA CATGGACAGC
GTCTGGACGC CCGTAACGGA CGCGTTAGAG CGCGTCTTCA ACGACGAGCA GGACAGCGAC
GCGGCGCTCG ACCAGGCCGC CTCCGAGATC CGGGAGGCGC TGTAG
 
Protein sequence
MNEKRRTLLK TMGGSTALAA LAGCISTGGD GGDGGDGSDG SDGSNGSDGS DGSDGSDSGT 
TGTTTLWADL SAAEDEAMSG YIDEYESDSG DTINKEAPGG ELDQQLETAI PAGDGPESWI
WAHDWVGRFA VREEPPFLYD ASDDVDVSLD SYTETARQAA QFDGALHGLP FASETVALFY
NEDMVDEPPE TMEEMVSIMD DHHDPANGQY GLSYPVTDPY FVSGFIQAYG GDIFDEENLK
VTVDSDACKQ GIDALETLSD YVPSDPGYES QIVAFADGLA PFAINGPWEL GNLQDEIDNL
GVTTLPTVDG NNPRTYSGIQ LFYFSSMLAD ADQSTVDATT GLAEWYTTNE DIVLSNADEQ
GYIPVLTNVV DNDDLSSEVQ AFAQQVDHGV PIPTHPDMDS VWTPVTDALE RVFNDEQDSD
AALDQAASEI REAL