Gene Hlac_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2647 
Symbol 
ID7400852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2631567 
End bp2633240 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content71% 
IMG OID643709719 
Productprotein of unknown function DUF1508 
Protein accessionYP_002567288 
Protein GI222481051 
COG category[S] Function unknown 
COG ID[COG3422] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAACG ACAGACAATC AGGCTCTCCC GGCGGGCTGT ACGAGCAACG CATCGGGACG 
CCGACGACAG ACGACGAGGT GAACGGGTTC TGGCTGTTCG GGTTCGGTGT GCTGCTCGGA
CTGGCCGGAG TCCTCCTGTT TTTCCTCACC GAGTCGGCGA CGACCGCTCG CGGGATCGCG
TACGCGCTCG CGGCGCTGGC TCCGCCGTTC ATCATGCTCG GCGCGGTGAT CCGCTTCCCG
CTGCGGCGGG CGGGAACGTT CATCGGATAC CTCGGGACGA CGGTCAGCGT CCTCGGCGTC
GTTTGGTTCG TGAACATCTT CCCGGACGGC TGGTCTACCG CCTCCGGCGA GCCAGCCGTG
ATCGCCGTGT ACGGAATCGG CCTCACGCTC ATCGGGCTCG CGGGGACGAT CGTCCCGCTC
CTGTCCGACC CGGTCTACGA GGACTACGAG CGGATGCAGG GCGAGGCGGC GGCCGCGACG
GCCGAGCGCG ACGAGACGAG CGCGGAGTTG GCGTCGACCC GCGAGGAGTT GGATGCGACC
GAATCGGAGC TTTCGGCGAC GAACGAGGAG CTGTCGGAGA CGGAGTCGGC GCTCGAAGCG
GCCCGCGCGG AGACGGCGGC GCTGCGGGGG AGCAAGGCGC GGTTCGAGCT GTTCGAGGAC
GCGGGCGGAA AGCCGCGCTG GCGGCTCCGC CACCGCAACG GCAACGTCAT CGCGACCGCC
GGGCAGGGGT ACAGCAGCCG CGGAAAGGCA CAAAAGGGCC TCCACAGTGT CAGGCGGAAC
GCGCTCGGCG CCGGGATCCT TCGGATCGAG ACCCCGGTCG CGGAGGCCGA GGCGATCGCC
GTCGACGACG GTCCCGAGCC CGCGGACGCC GCGGACCCGA ACGTGGCCGT CCCGAGCGAC
GACGAGGCGA TCGCGAGCAA GGCGACCTTC GAGCTGTTCG AGGACGCGGG CGGGGAATGG
CGCTGGCGAC TCCGCCACGA CAACGGGAAC ATCATCTCGG ACTCCGGCGA GGGGTACGCC
TCCAAGTCGA ACGCGAAGCG CGCGCTCGGC CGGGTCCGCG AGCACGTCGC CGCCGCGGAC
TACCTCCGGG TCGACCCGAC CGCCTTCGAG GTGTTCCGGA ACGCCGGCGG CGAGTGGCGC
TGGCGGCTCA TCCACGAGAA CGGGAACGTG CTCGCCGACT CCGGCGAGGG GTACTCCTCC
CGGTCGAAGG CCCGGCAAGG GCTCGACAGC GTGCAGTCGA ACGCCGCCGA GGCGGCCCTC
GAAGCGGTCG GCGACGACGG CGTGGCCGGT GACGCCGACG GGAACCCGAA CGCGACCTTC
GAGCTGTACG AGGACGCGGC CGAGGAGTAC CGCTGGCGAC TCCGCCACCG CAACGGGAAC
ATCATCGCGG ACTCCGGCGA GGGGTACGCC TCCGAGTCGA ACGCGCGGGA CGCGATCGGG
CGCGTCCGCG AGTACGCCCC CGACGCCGAC GTGTTGGAGG TCGGCAACGC CGCCTTCGAG
ATCTACGAGG ACGCCGCCGA CGAGTGGCGC TGGCGGCTCC GCCACCGCAA CGGGAACATC
ATCGCGGACT CCGGCGAAGG GTACGCCTCC CGGTCGAACG CGGTCGAGGG GGTCACGGGC
GTGAAGGCGA ACGCCCCCGG CGCGGAGGCG GAGACGGTCG AGGCGGAGGA GTAG
 
Protein sequence
MANDRQSGSP GGLYEQRIGT PTTDDEVNGF WLFGFGVLLG LAGVLLFFLT ESATTARGIA 
YALAALAPPF IMLGAVIRFP LRRAGTFIGY LGTTVSVLGV VWFVNIFPDG WSTASGEPAV
IAVYGIGLTL IGLAGTIVPL LSDPVYEDYE RMQGEAAAAT AERDETSAEL ASTREELDAT
ESELSATNEE LSETESALEA ARAETAALRG SKARFELFED AGGKPRWRLR HRNGNVIATA
GQGYSSRGKA QKGLHSVRRN ALGAGILRIE TPVAEAEAIA VDDGPEPADA ADPNVAVPSD
DEAIASKATF ELFEDAGGEW RWRLRHDNGN IISDSGEGYA SKSNAKRALG RVREHVAAAD
YLRVDPTAFE VFRNAGGEWR WRLIHENGNV LADSGEGYSS RSKARQGLDS VQSNAAEAAL
EAVGDDGVAG DADGNPNATF ELYEDAAEEY RWRLRHRNGN IIADSGEGYA SESNARDAIG
RVREYAPDAD VLEVGNAAFE IYEDAADEWR WRLRHRNGNI IADSGEGYAS RSNAVEGVTG
VKANAPGAEA ETVEAEE