Gene Hlac_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0244 
Symbol 
ID7401170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp263745 
End bp265472 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content66% 
IMG OID643707307 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002564919 
Protein GI222478682 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.794885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.536815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCC GCGACGAGAG CGTTTCACGG AGAAAGTTCC TCGGTGCCGC CGGTGGCGCT 
GCAGTAACGG TCGGTCTCGC CGGCTGTTCC GACAACGACG GCGAGGATTC CGACGGTTCC
GACGGCTCGG ACGGCTCCGA TGGTTCGAAC GGTTCGGACG GCTCCGACGG TTCGGACGGC
GGCGACGACA CCAGCCTCCT CCGGTACGGC CGGGGGAGTC ACTCGGCGAC GCTGGACTTC
CAGAACAGCA CGAGCGGCGA GGTTGCGAAG GTGACCGAGC AGATCTACGA CACGCTCATC
AACTTCGAGC CGGGCGAGTC GACGCTCACC GACGGGCTCG CCTCGGACTA CTCGCTCGAC
GGCGAGACGG CATCGCTCAC GCTGAAGGAG GGGGTCACCT TCCACAACGG CGAGGAGTTC
ACCGCACAGG ACTTCGAGGC GACGTACCGC CGCTTCGTCG ACTCGGAGTA CGAGTACTAC
GCTGGCGACG ACTACGTCTC CGCGTACGGT CCCTTCACGC TCGGCAACTG GATCGACGAG
ATTCAGGTCG ACGGCGACTA CGAGATGACG ATCCAGCTCA CGCAGACGTA CGCGCCGTTC
CTGCGTAACC TGGCGATGTT CGCGGCCGCT GTCCACTCCG AGGCCGCCAT CGAGGAGTAC
GGCACCGACC TGTCCGAAAA CGCGGTCGGA ACGGGGCCGT TCGAGCTCAA CACCCTCGAC
GACTCCAACG AGCAGATCCG ACTCGACGCG TACGACGACT ACTGGGGCGA CGGCCCGCAG
GTCGACGAAG CCGTTTTCGT CACGGTCGGC GAGAACTCCA CCCGAGCGCA GTCGCTCGCG
AGCGGAGAAC TCGACATTAT CGACGGGCTC GGCGCGCAGT CCTCCCAGCA GGTCGAAAGC
GCCGACAGCG CCGAACTGGT CCGCACCGAG GGGATCAACA TCGGCTACAT GGCGTTCAAC
ATCGCGGCGG TCGAGGAGTT CCAGGACCGC CGCGTCCGTC AGGCCGTCAG CCACGCGATC
AACACCGAGG CGATCGTCAA CCAGATCTAC GCCGGCTTCG CGACGGAGGC CAGCCAGCCG
CTGCCGCCGA ACGTGCTGGG CCACAACGAC GACATCGAGC CGTACCCGTA CGACCCCGAG
CAGGCACAGA GCCTGCTGGA GGAAGCCGGC TACGGCGACG GGTTCTCCTT CGAACTGGCG
ACGTTCCAGA ACCCCCGCGG ATACAACCCC TCGCCGCTCC AGACGGCCGA GACGGTCGCC
TCCAACCTCG GCGAGGTCGG CATCGAGGTC GAGATCAACC AGCAGTCGTT CGCGCCGTTC
CTTGAGTACA CGGCTCAGGG CCGCCACGAC GCCTGCTTCC TCGGCTGGTA CACCGACAAC
GCGGACCCGG ACAACTTCGC GTACGTACTC TTACACCCGC AGGTTGAGGA GAGCGAACTC
ACCGAGGGCC AGGACTGGGT GAGCTTCGAT ACCGAGGGGT ACAACACGAG TAACCGCTCG
GCGTGGGCGA ACCAGGAATA CATGGACCTC GTCGAGGAAG GTCAGCAGAC GACCACAGAG
AGCGACCGCG CGGAGCTCTA CAACGAGGCG ATGCAGATCG CCCACGACGA GGCGCCGTGG
GTGTACCTGG ACTACGCCGA GGAGCTGCGG GGCGTCGCCA ACCGGGTCAA CGGGTTCCAG
ATCGCCGCGA TCAGCGGCCC GTACCTGAAC CTGGTCTCGC TGGAGTAG
 
Protein sequence
MSSRDESVSR RKFLGAAGGA AVTVGLAGCS DNDGEDSDGS DGSDGSDGSN GSDGSDGSDG 
GDDTSLLRYG RGSHSATLDF QNSTSGEVAK VTEQIYDTLI NFEPGESTLT DGLASDYSLD
GETASLTLKE GVTFHNGEEF TAQDFEATYR RFVDSEYEYY AGDDYVSAYG PFTLGNWIDE
IQVDGDYEMT IQLTQTYAPF LRNLAMFAAA VHSEAAIEEY GTDLSENAVG TGPFELNTLD
DSNEQIRLDA YDDYWGDGPQ VDEAVFVTVG ENSTRAQSLA SGELDIIDGL GAQSSQQVES
ADSAELVRTE GINIGYMAFN IAAVEEFQDR RVRQAVSHAI NTEAIVNQIY AGFATEASQP
LPPNVLGHND DIEPYPYDPE QAQSLLEEAG YGDGFSFELA TFQNPRGYNP SPLQTAETVA
SNLGEVGIEV EINQQSFAPF LEYTAQGRHD ACFLGWYTDN ADPDNFAYVL LHPQVEESEL
TEGQDWVSFD TEGYNTSNRS AWANQEYMDL VEEGQQTTTE SDRAELYNEA MQIAHDEAPW
VYLDYAEELR GVANRVNGFQ IAAISGPYLN LVSLE