Gene Hlac_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0133 
Symbol 
ID7401654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp140749 
End bp142743 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content68% 
IMG OID643707197 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002564809 
Protein GI222478572 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGGC AGATCAGTCG GCGCGGCGCG CTCGCGGGAG GGTTCGCGGC CCTGAGCGCC 
GGCTGTCTCG GCCGCACACG GAACATCGCG GGACGCGACC GGTCCTCGCA GCTAACTCTT
GAGATCAACG CCCCGCCCGC CGACAGGGAC CCCAACGCGA TTCGGATCGC CAGACACCTC
GCGGAGAACC TGAACGCAGT GGGCATCGAC GCCCGGATCA GCACCCTCGG GCACACCGAC
CTCCGGCGGA AGGTGCTCAT CAACCACAAC TTCGACGTGT ACGTCGGGCA GTTCCGGGAG
GCGGAGCCGT TCGACCCGGA CGCGATGTAC GCGTTCACTC ACTCCCAGTT CGTCGCGGAG
TCGGGGTGGC AGAACCCGTT CGGATTCACC GACATCAACG GCGTCGACGA GTTGCTCGCG
ACCCAGCGCC GAGCGGACGG CGACGAACGA CGCGAGGCCG TGGCGGAGCT CCAGCGAACC
CTCGGTGAGC TGCAGCCGTT CACCGTCGTT GCGTTCCCCG ACCCGCTCAT CGCGGTCCGC
GAGGACCGCT TCGAGAACTG GACGAACCAT CAGCCGCTGT CAGTCGGTGG GTTGCTCGGC
TTGGAGCGCT CCGCCGCGGC GGACGGGGAC GCCGACGGGA GCGGGGCCAA GATCGAGGAC
GGCGGGACGG AAGCCGGGAA CGGGACTGCC GACGGCAACG AGACCGCCGA CGGCGACGAG
ATCGATGGTA ATGAGACCGC TGTCAACGAG ACCGACGACG GGCTGATCGA CGACAACCCG
CTCACGGACG ACGACGATGG TGCGGACGAT GCCGCCACCC TCCGGCTCGT GACGACCGAC
GAGCGCATCA CGCAGAACTG GAACCCGATC GCCGCCGAGT ACCGGCGCTA CGGCACGTTC
ACCTCCCTGC TGTACGACCG ACTCGTGCTG GTCGACGACG GGGAGGTGAT CCCGTGGCTC
GCCGCCGACT GGGAACAGGT CGGCGACGCG GGGGTCAAGA TCTCGCTGCG AGAGGCCGAC
TGGCACGACG GGGAACCGGT GACGGCCGAA GACGTCGCGT TCACCTACGA GTTCCTCCAA
GACACCTCGC TGGGAACGAC CGAGTCGCCC GTGCCGACAC CGACCTTCCG CGGTCGGGTG
TCCGCCGTCG AGACGGCGAC CGCCATCGAC GAGACGACGG TCCGGCTGAC GCTCGACGGC
GTCAACGACG CGGTCGGCAT GCGCGCGCTC CAGGTACCGA TCCTCCCGAA GCACGTTTGG
GGGGAGCGGA CCGATATGGC GACGATCGCC GGATTCGAGT TCGACGCGGA GACGACCGAG
GCCGTGGTGA CGAACAACGA GAATCCGATC GGGAGCGGTC CGGTGCGCTT CGTCGAGGCC
ACTCCCGAAG AGTCGGTCGT CTTCGAGCGC AACCCGGACC ACTTTCTCGT GCGTGCGGCA
GACGGGGGGG AATCGGCCGG TGACGCGACG GACCCGCTCA CGGAGATCTC CGAGCGATTT
CACGGGAAGC CGGCGTTCGG CCGCCTTGAG ATCGAGGTAA TGGGGTCAGA CATCGCCGCG
GTGCAGGCGG TCGGAGACGG CTTCGCGGAC GCGACAGTCT CGAACCTCGG CCCGGACTCT
GTCCCGCGGA TCGGGCGCGA AGCCGACGCT CGGCTCGTGA CAGGGCGATC CGGCGGGTTC
TACCACATCG GGTACAACAC TCGCCGGGCA CCGCTGTCGA ACCCCCGATT CCGCAGGGTC
CTCGCGTCGC TGATCGACAA GCAGACCCTC GTCGACGTTG CGTTCGACGG GTACGCCGAA
CCCGCGGCTT CACCGCTCGC CGCCACCCCG GAGTGGGTGC CCTCGGACCT CCGCTGGGAG
GACCGCGAGA CGGACCCAGT CCATCCGTTT GTCGGCGAGT CGGGATCGCT CGACTCCGAG
ACGGCCCGTA ATCGACTCCG CGAGGTGGGG TACCGGTTCG ACGAGGAGGG ACGGCTGCTC
GCACCGAACA CATGA
 
Protein sequence
MTRQISRRGA LAGGFAALSA GCLGRTRNIA GRDRSSQLTL EINAPPADRD PNAIRIARHL 
AENLNAVGID ARISTLGHTD LRRKVLINHN FDVYVGQFRE AEPFDPDAMY AFTHSQFVAE
SGWQNPFGFT DINGVDELLA TQRRADGDER REAVAELQRT LGELQPFTVV AFPDPLIAVR
EDRFENWTNH QPLSVGGLLG LERSAAADGD ADGSGAKIED GGTEAGNGTA DGNETADGDE
IDGNETAVNE TDDGLIDDNP LTDDDDGADD AATLRLVTTD ERITQNWNPI AAEYRRYGTF
TSLLYDRLVL VDDGEVIPWL AADWEQVGDA GVKISLREAD WHDGEPVTAE DVAFTYEFLQ
DTSLGTTESP VPTPTFRGRV SAVETATAID ETTVRLTLDG VNDAVGMRAL QVPILPKHVW
GERTDMATIA GFEFDAETTE AVVTNNENPI GSGPVRFVEA TPEESVVFER NPDHFLVRAA
DGGESAGDAT DPLTEISERF HGKPAFGRLE IEVMGSDIAA VQAVGDGFAD ATVSNLGPDS
VPRIGREADA RLVTGRSGGF YHIGYNTRRA PLSNPRFRRV LASLIDKQTL VDVAFDGYAE
PAASPLAATP EWVPSDLRWE DRETDPVHPF VGESGSLDSE TARNRLREVG YRFDEEGRLL
APNT