Gene Hlac_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3666 
Symbol 
ID7402457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp429644 
End bp431335 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content57% 
IMG OID643710197 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002567763 
Protein GI222481527 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA CCACCGAGCC AACCGAAGAG GTAGCAGGCG AGGTTTCGGT CACGGACCTC 
CTCACGAACA AATCCCGGCG TCGGTTCCTC TCGGGTCTCG GTGCTGTCGG GGCGGCCGGT
CTCGCCGGTT GTACTGGTTC AGGAGGCTCT GAAGACACGA CCACTAGCTC CGACGGCACA
ACCAGCAGCT CCGACTCAAC GCTCACTGCA AACGTCTCAC AGCGGATCGG CACAATCGAC
CCCGCGAAAG GGACCGATTA CGTGCAAGCG ATGGTGCTTG TGAACCTCTA CGATCCGCTC
GTGTTTCCCA ACAGCGAGGG TGAAATTCAA CCGAATCTCG CCTCCGATTG GACCGTTTCG
GAGGACAGCA CGACCTACAC GTTCACCCTG CGTGAGGATG TGACGTTCCA CAGCGGAAAC
TCGTTTACTG CCGAGGACGT AAAATTCTCC ACGGAGCGTT TTATCGATCT CGACCAGGGG
TACGCCTCGC TGTTGAGCGG CGTCCTCGAC AAAGAAAACA TTACTGTCGA AGACGAGCAG
ACGGTCACCT TCGAGTTGAA CCGGTCGTAC GCGCCGTTCT TGCCCATCAT GGTTCTCGTG
TTCATGGTCG ACAAGGCGAC GATCATGGAC AATTTAGAAG ACGGCGAGTA CGGCGACCGT
GGTGACTACG GGCAAGCGTA CATCAACAAC AACGACGCCG GATCCGGTGC ATACCAACTC
GAAGACTTCT CGCGGGGGAA TTCCATCACC TTCGCCGCCT TCGACGACTA CTTTGGCGAG
TTCCCCGACG GCTCCTTCGA TACGGTCGAA GTCCAGATCA TTACTGAAAA TTCGACGGTT
CGAACGCTGA TGCGAAACGG CGATCTGGAT ATGAGTGGTC AGTACCAGAA CTCCCAGACG
TACCAAGCCA TCGACGAATC GGACAACGCA CGTGTTGAAG AAATACCGAC GTTCGGCCTA
CTGTACAACA AAATCAACAC CCAGAAAGCT CCGACGGATG ACCGCGCCGT GCGCGAGGCG
ATCGCGTGGG GATTCGACTA CGAGCAGGTT GTCAATACGA TTCGGCCGAA AATGAATCGC
GCACAGGGAC CGCTGCCGCC GACGTGGGGC GAACACGACG AAGACGTTTT GCAACCGTCC
TACGACCCGG ATCGAGCGAG ACGGGTCCTC GAAGACGCTG GCTACTCGGA GGGCGAACTC
ACCATCACGA ACACGTACAC CGAGTCATAC GCTTTCCAAG AGCAGATTGC CCTGCTCTTC
CAGGATAACA TGGCAGATAT CGGGATTAAC GTTGAGCTGA ATCCTCAGAC GTGGGGGACG
ATCACAGAGT TGGCTACGTC ACCGGAAGAC ACCCCACATA CGAGCCAAGT GTTCTACGTT
CCGACCTATC CCTCGCCGGA TTCGATGTTC TACAACCAGT TCCACTCGGA GGCCGCAAAC
ACGTGGATGA GCATGGAACA TCTCGACAAC GACGAGGTCG ACGCCCTCAT CGATGAGGCA
CGCCAGACGC CGGACCCGGA AGCCCGTGCC GAGATCTACC GGGAACTGCA GAATACCCTA
GCTGATCTCT ACTGTGATAT GCACCTCTAC CACACGGTCA AAACAATCGG CTTCCAGAAC
GACGTTGAGG GGCTCACCCT CCGGCCAGCA CAAGGCTTCG AATACACGTT CCGGGATCTC
CACCAAGTCT AA
 
Protein sequence
MNDTTEPTEE VAGEVSVTDL LTNKSRRRFL SGLGAVGAAG LAGCTGSGGS EDTTTSSDGT 
TSSSDSTLTA NVSQRIGTID PAKGTDYVQA MVLVNLYDPL VFPNSEGEIQ PNLASDWTVS
EDSTTYTFTL REDVTFHSGN SFTAEDVKFS TERFIDLDQG YASLLSGVLD KENITVEDEQ
TVTFELNRSY APFLPIMVLV FMVDKATIMD NLEDGEYGDR GDYGQAYINN NDAGSGAYQL
EDFSRGNSIT FAAFDDYFGE FPDGSFDTVE VQIITENSTV RTLMRNGDLD MSGQYQNSQT
YQAIDESDNA RVEEIPTFGL LYNKINTQKA PTDDRAVREA IAWGFDYEQV VNTIRPKMNR
AQGPLPPTWG EHDEDVLQPS YDPDRARRVL EDAGYSEGEL TITNTYTESY AFQEQIALLF
QDNMADIGIN VELNPQTWGT ITELATSPED TPHTSQVFYV PTYPSPDSMF YNQFHSEAAN
TWMSMEHLDN DEVDALIDEA RQTPDPEARA EIYRELQNTL ADLYCDMHLY HTVKTIGFQN
DVEGLTLRPA QGFEYTFRDL HQV