Gene Hlac_0069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0069 
Symbol 
ID7401424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp72570 
End bp74471 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content63% 
IMG OID643707130 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002564745 
Protein GI222478508 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.884442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA AAGACGAACT CAGGCGGCGT CAGTTCCTCC GAGTGACCGG AGCGTCGGCA 
CTCACCGCCG GCATCGCAGG TTGTTCCGGA GACAGCGGCG GCGAACCCAG CGATGGGGGC
GACGGCTCCG ACGGCGAGGA CGGATCCGAC AGCGAAGACG GATCCGACGG CGAGGACGGA
TCCGACGGCG AGGACGGATC CGACCAGTTG ATGCCGTCGT CGTACCCGTA CGGGGCTAAC
GAAAATCGGA TCAGTGAGGC TCGAGCGGTT ATGGAAGAGG CCGGATACGG CCCTGACAAC
CGATTCAGCC TCGATTGGCT CCAGTACAAC TCCCCGGCGT GGGAGGAGAT TGCGAATACG
CTTCGTGCCC GCCTCGAGTC GGTCCACGTC GATATGAACA TCAGCAAGGC CGACTTCGGC
GCCCTCCTCG AGCGGACCGA AAAGGGCGAG ATGGACGCGT TCACGCTCGG CTGGATCGCG
GATTACCCGG GTGTCAGGAA CTTCGCTCAG CTGGTCGATC CGGATAACAC GATCTACGAC
GCCGAAGGGG CTTCGCCGAA CGGTGCGCGG CTGTTCTGGA GCGAAGATTC CTACACCGAC
CCCGAAGTCC GCAGCGCGAT GTCAGAGGCG TTCGCGCAGC TTTCGGAGAA TCCGGGCAAC
AAGGACGAGG CGGAGAGCGC ACGCGCGGAG GCGACGCTTC GGTTGGAGAA GCTCCTCTGG
GAGTCCGCGG CGTTGCTCCC GGTGTATCAC AGCGTCGAAG ACGTGTTCTG GTACGACCGG
GTCGACTACA ATCCGCCGGG CGGGATGGGA GTGTCCCGGG CGAAGACCAG CACGTCCGTC
CAGGGACTCG AAGGCAGCGA CACGCTGAAG GGCACGTCCG CGACCTTCAA CGCACTCGAC
CCGATCGCGT CGGGGAACAC GGCGAGTGGT TCGAAGGTCA TGGACATATT CGACGCACCG
CTCAACTACG TCAACGGAAC GGTCGAGGTC GAGCCGCTTC TGATCGAGGA CTACACCACG
AACGACGACC TCACCGAGTA CGAGTTCACG CTCAAGCAGG GCGTCCAGTT CCACGGGGAT
TACGGCGAAC TGACGGCGGA CGATATGGTC TACTCGATCC GGCGTCTGGT CGAATCCTCG
AACTCGACGA ACACGTACTT CCCGATCAGC GTCCTGAACA TCGACCGCGA GGAGGACGAG
GATGGCAACG TCGTGCCCGG ATCGGTTGCC GTCGAGGCGA CCGGCGACTA CACCTTCAGC
GTCACGCTCC GCAATTCGTT CGGCTACGCG CTCGAAGTGC TCAGTTACTC GGCGTTCTCG
GCCGTCCCCG AGGGTATCGT GGGCGACGTC GAGGGATACG ACGGCGACAT GGATTACCAG
CAGTTCTCGA CGAACCCGGT CGGCTGTGGT CCGTACGTCT TCGAGGAGTG GAACTCCGGT
GTCGGCGGAG AGTTCCGAGC CTCGGCGTTC ACGGACTACC ACGGCGGAGA GCCGGCCGCC
GCGAACATCC AGGACGCGAT CCTCAGCGAG CCGAACGCGA TATACAACCG GTTCCTCAAC
GAGAACGCGG ACGTCAGTGC GATCCCGACC TCGCAGTTCG ATCCCGGACT GAGCGACCTG
ACGAGTCAGG ACGGCGCCCA ACAGACCGGA ACGTACGGTC CGCTCGGGAA CGACCAGACG
GTCAACATGT CGCGGACGCC GACGATCGAC ACGTTCTACA TCGCGTTCAA CATGGAGAAC
GTCCCCAAGC CGGTCCGACA GGCGATGGCG TACGTGATGA CGGGCGACGA CTTCACCGAG
AGCGTCTTCA AGGGTCGTGG CGAGTCCGCG TACCACCTCA CGCCGCCACA GATCTTCCCC
GGCGGCGGTG AGGGGTACGC CGACCACTGG CAGGGCGAAT AA
 
Protein sequence
MSDKDELRRR QFLRVTGASA LTAGIAGCSG DSGGEPSDGG DGSDGEDGSD SEDGSDGEDG 
SDGEDGSDQL MPSSYPYGAN ENRISEARAV MEEAGYGPDN RFSLDWLQYN SPAWEEIANT
LRARLESVHV DMNISKADFG ALLERTEKGE MDAFTLGWIA DYPGVRNFAQ LVDPDNTIYD
AEGASPNGAR LFWSEDSYTD PEVRSAMSEA FAQLSENPGN KDEAESARAE ATLRLEKLLW
ESAALLPVYH SVEDVFWYDR VDYNPPGGMG VSRAKTSTSV QGLEGSDTLK GTSATFNALD
PIASGNTASG SKVMDIFDAP LNYVNGTVEV EPLLIEDYTT NDDLTEYEFT LKQGVQFHGD
YGELTADDMV YSIRRLVESS NSTNTYFPIS VLNIDREEDE DGNVVPGSVA VEATGDYTFS
VTLRNSFGYA LEVLSYSAFS AVPEGIVGDV EGYDGDMDYQ QFSTNPVGCG PYVFEEWNSG
VGGEFRASAF TDYHGGEPAA ANIQDAILSE PNAIYNRFLN ENADVSAIPT SQFDPGLSDL
TSQDGAQQTG TYGPLGNDQT VNMSRTPTID TFYIAFNMEN VPKPVRQAMA YVMTGDDFTE
SVFKGRGESA YHLTPPQIFP GGGEGYADHW QGE