Gene Hlac_2482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2482 
Symbol 
ID7401534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2459733 
End bp2460935 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content66% 
IMG OID643709554 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002567125 
Protein GI222480888 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0758562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACA GCGAATCCGA TCGCGAGAGC GGACGTGTGG GGGTATCGCG GCGACAGTTC 
TTGGAGGTGA CGGGCGCAAC GGGGGCAGCC GTCGGGCTCG CCGGCTGTTC CGGCGGTGGC
GGCGGCGGCG ACGGTCCGAT CCAGATCACG ATGGATGCCG AGTGGGGAGG CATCTCTGAC
GCCCTCACTC AGAGCCTGTA CGACGCGGGT CTCGACGAGT CGATCGAAAT CGAGATCCTG
CCGGGCGACT TCGAGTCGGG GGCGCGCCGG TCGGAGTTCA CGTCGGCGCT CGACGCCGGG
CGGGCGAGCC CGGACATCTT CATGATGGAC TCAGGGTGGA CGATCCCGTT TATCGCGCGC
GGTCAGCTCG TGAACCTGAG CGACGAGCTC TCCTCCGAGA CGCTCGATTA CGTCCAGAAC
GACTACCTGC CGAGCGCGGT AAACACCGCG AGCGACCCAG AGAGCGGCGA CCTGTTCGGA
CTGCCGCTGT TCCCGGACTA CCCGGTGATG CACTATCGAA AGGACCTGGT CGAGGACGCC
GGCTACGACC CGGACGGCGA GAACTGGGCG ACCGAGCCGA TGAGCTGGCA GGAGTTCGCC
GAGATGGCCG CCGACGTGTG GGAGCAGAAC GGCGGCCCCG GTGGCGACTT CGATTACGGA
TTCACGACTC AGGGCGACAA CTACGTCGGG CTCGCCTGCT GTACGTTCAA CGAGACGATG
ACTTCCTTCG GTGGCGCGTA CTTCGGCGAC CACGAGAACC TCTTCGGCCC GATCGGCGAT
CGGCCGATCA CGGTCAACGA GGAGCCCGTT CACGACACGA TCCGCATGAT GCGGTCGTTC
ATGGAGGGGC CCGACGCCGA GTACGCTCAC CCGGACTTCC CGCAGATTTC GACGACAGAT
CTGCTCTCGT TCACCGAGGA GCCGTCCCGT GAGCCGTTCA CGTCCGGGAA CGCGATTTTC
CACCGGAACT GGCCGTACGC GATCCCGCTC AACCTCGACT CCGAGGAGTT CAGCGCGGAG
GATTACGACG TGATGCCGCT TCCGTACGGC ATCGAGGCAG GCGAGGGCGA GTACGAGGGC
ACCGGCGGCG CCGCGGCGCC GCGGCGGCGC TCGGCGGCTG GCACCTCACG ATCAACCCGA
ACACCCCGCG GCTCGACGAC TGCGTTCAGG TGCTCGAGGC GTTCGCCAAC GAGGAGGTCA
TGA
 
Protein sequence
MVDSESDRES GRVGVSRRQF LEVTGATGAA VGLAGCSGGG GGGDGPIQIT MDAEWGGISD 
ALTQSLYDAG LDESIEIEIL PGDFESGARR SEFTSALDAG RASPDIFMMD SGWTIPFIAR
GQLVNLSDEL SSETLDYVQN DYLPSAVNTA SDPESGDLFG LPLFPDYPVM HYRKDLVEDA
GYDPDGENWA TEPMSWQEFA EMAADVWEQN GGPGGDFDYG FTTQGDNYVG LACCTFNETM
TSFGGAYFGD HENLFGPIGD RPITVNEEPV HDTIRMMRSF MEGPDAEYAH PDFPQISTTD
LLSFTEEPSR EPFTSGNAIF HRNWPYAIPL NLDSEEFSAE DYDVMPLPYG IEAGEGEYEG
TGGAAAPRRR SAAGTSRSTR TPRGSTTAFR CSRRSPTRRS