Gene Hlac_0252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0252 
Symbol 
ID7401178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp272588 
End bp274228 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content66% 
IMG OID643707315 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002564927 
Protein GI222478690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGCG ATCACTCGAA CGTGAACCGC CGGACGTACC TCACGTACGT CGGAGGAACT 
GCGGCCACCG TCGGGTTGGC CGGCTGTTCG GACAACGGCG GCTCGGGCGA GGACAACGAA
AACGGCAGTG ACGGCAGCGA CGGCAGCGAC GGCAGCGATG GCGGGGACGA AGAGCAGCTT
CCGGAGCCGG AGACCCGCGA AGAGCACCTC CAGCGCGCGA ACCTCCGGCT CAACCAGCGC
GCCCCGTGGA TCTTCCTCAA CCGCCAGTAC AGCGTGTACG GCATCGCGAG CCGACTCGGC
TGGGACGCTC GCCGCGACGA GCGGATCGAA GCGCAAGCCA TCTCGGTGAC GGAGGGCGAG
CCGTCGGTCG CGATCACCCA GTCGTCGATG GATTCGGGGC TCGACCCCCA CGACCACCGT
GAGACGCCGA CCGACAACAT CGTCGTGCAG GCCTACGACG GCGTGCTCGG TCGCAACGCC
GACGGCGACA TCATCGACGC GCTCGCGACG GACTACGAGC GGCTCGAAGA CGGTCGCGTC
CGATTCGAGA TCCGCGACGG CGTCACCTTC CACAACGGCG ACGAGCTCCA GCCGTCGGAC
ATCGCCTACA GCGTCAACCG GGTCGTCGAC CCCGAGGTCG GAATCTCGAG CCCGCAAAAC
GACCAGCTCG CCGGCGTCAC GGGCGCCGAG GTCGTCGACG GCGGGGTCGA GGTGACCTCC
GACGGGATCA ACCCGATCGT GTTCTCGCTT TTCGCCTCGT ACTGTAAGGT CGTTCAGCAG
GACTGGATCG AGTCGCGCGA CACCTCGGCG ATAAACTCCG ACATGAACGG GACCGGCCCG
TTCCAAGTCG TCGAGTACGA GCAGGACGTC GAGATCGTCT ACGAGCCCTA CGAGGGGTAC
TGGGGCGACG CGCCGGAGAT CGAAGAGCTG ACGATCCGGT CGGCCAGCGA GGCGAGCACG
CGCGTCTCGC AGCTGCTGGC CGGCGAGACG GACCTCATCG TCAACGTGCC GCCGCAGGAA
GTGAGCCGCG TCCGCGACGA GGACACCACG GAAGTCACAG CGGTGCCGAG CACTCGTGTC
GTGTTCAACG CCATGCGGTA CGACGTAGAG CCGTTCTCCA GCGTGGAGTT CCGGCAGGCG
ATGAACTACG CCATCGACTT AGACAGCATC ATCGAGAACA TCCTGCAAGG GTTCGCCGAC
GCGACCGGCC AGCCGACCCT CGAAGGGTTC GTCGGCTACA ACGAGGAGAT CGATCCGTAC
CCGCAAGACA TCGAGCAAGC CGAACAGCTC GTCGAGGACT CCGGTCACGC CGGCGCCGAG
ATCACCCTCG AAACGCCCGT GGGACGGTAC CTCCGCGACG TGGAGATCGC ACAGGCGGTC
GCTAGCCAGA TCGACGAGCT CTCGAACGTC TCCTGTGAGG TCGAGCAGCG CGACTTCGCC
TCGCTCGCGG GCGAGGTGAC GAGTGGTGAT ATCGAAAACA TGCCCCACTT CTACCTGCTC
GGCTGGGGGA ACACGACGTT CGACGCCAGT CAGACGATCA TCCCGCTCCT CACGTCCGAC
GGGGCGCTCT CCAGCTATCA GGGCGACGAC GAGGTCGACG AGCTCATGTC CGAGTCCCAG
AACCTGCCGG GCGGAAACTA A
 
Protein sequence
MSRDHSNVNR RTYLTYVGGT AATVGLAGCS DNGGSGEDNE NGSDGSDGSD GSDGGDEEQL 
PEPETREEHL QRANLRLNQR APWIFLNRQY SVYGIASRLG WDARRDERIE AQAISVTEGE
PSVAITQSSM DSGLDPHDHR ETPTDNIVVQ AYDGVLGRNA DGDIIDALAT DYERLEDGRV
RFEIRDGVTF HNGDELQPSD IAYSVNRVVD PEVGISSPQN DQLAGVTGAE VVDGGVEVTS
DGINPIVFSL FASYCKVVQQ DWIESRDTSA INSDMNGTGP FQVVEYEQDV EIVYEPYEGY
WGDAPEIEEL TIRSASEAST RVSQLLAGET DLIVNVPPQE VSRVRDEDTT EVTAVPSTRV
VFNAMRYDVE PFSSVEFRQA MNYAIDLDSI IENILQGFAD ATGQPTLEGF VGYNEEIDPY
PQDIEQAEQL VEDSGHAGAE ITLETPVGRY LRDVEIAQAV ASQIDELSNV SCEVEQRDFA
SLAGEVTSGD IENMPHFYLL GWGNTTFDAS QTIIPLLTSD GALSSYQGDD EVDELMSESQ
NLPGGN