Gene Hlac_2504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2504 
Symbol 
ID7401556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2480521 
End bp2482614 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content70% 
IMG OID643709576 
ProductProtein of unknown function DUF460 
Protein accessionYP_002567147 
Protein GI222480910 
COG category[S] Function unknown 
COG ID[COG2433] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.459934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGTATA TACGCTCCAA CGGCCTACGT GAATACGTGA CAACCCGGAC CATCCGCGCC 
CGCGACCGGC CGGTGTTCGG CGTCGACGTC CACAGCGGCG ACGTCCGCGG CGACGACCCC
TCCTACGCCC TCGTGATCTT GGATCCCGTC GACGACGACG ATCCGGACGC GCCCGACACG
GACGAGCCGA TGGCGCGGGT GACCCGCGAC GTGGTGTCGT TCCGGAAGCT CTGCCGATTG
ATCGACGACC GCAAACCGCT GTACGTCGCC ACCGACAACG CCTACGAGCT GGCGGCGAAC
AAAAACGATC TCGTGGGCTT CCTCCGATCG CTTCCCGACG GCACGCGGCT GGTGCAGGTG
ACCGGCGCGG AGCGCCCGGA GCCGCTCTCG CGGGTCGCCT CCAGACACGG GATCCCGTAC
GGGAAAGAGC CGATGAAGGA GGCTGAGGCG TCGGCCCGGC TCGCGGCCGC CAACGTCGGC
CACGAGGTGA CCGCCTTCAC CGACGAGACG CAGGTGAAGG TGTCCCGCGG GCGCTCGACC
GGGAAGGGCG GCTGGAGTCA GGACCGCTAC ACCCGGCGGA TCCACGGGAA CGTCCGGAAG
CGGACCCGAC AGGTCCAGTC GAAGCTGAAG GAGGCGAACC TCGACTTCGA GCGCGACGTG
ACCGAGAAGT ACGGCGGCTA CGCGAACGCG ACGTTCGCGG TCGAAGCGCG CCCGGAGGAC
ATCCCGGTGT CGAACTCGCG GGCGGGCGAC GTGCGCGTCG AGGTCGAGCG CGAGCGCCGC
GACGGGATCG AGTACGAGCC GCTCGTGAAG CGACGCGACC GGGTGATCGT CGGGATCGAT
CCGGGGACGA CCACCGCGGC CGCCGTGGTC GGGCTCGACG GCACCGTCCA CGCCCTCTAC
TCTTCGCGAA CGTCCGACAC GGCCGACGTG ACGGAGTGGA TCATCGAGCA GGGCCGCCCG
ATCATCGTCG CCGCCGACGT GGAGCCGATG CCGGAGACCG TCGAGAAGTT CCGGCGTTCG
TTCGACGCCG CGGGCTGGCG ACCGACCACG GACCTGCCCG TCGACGAGAA ACTCCATCGG
ACCCGCGAGA CCAACTACGA CAACGATCAC GAGCGCGACG CGCTGGCGGC CGCGCTGTAC
GCCTACGACG ACCACGAGGA CCAGTTCGAG CGCATCGCGG CGAAGACCCC GCCTCGGCTC
GACCGCGAGG CGGTGATCGC GGGCGTCGTC GCGGGCGGGT CATCCGTCGA GGCCGTCATC
GAGGGGCTGA GCGAGGACGA CGGGAGCGGC GGTGGTGACG GGGGTGGCGA CGGCGGCGCG
GAGGAGACGG ACCCCACCGA GCCCGAGCGC ACCGAAGAGG AGGAGACGAT CCGGCGGCTC
CGCGAGCGCG TCGGCCGGCT GGAGTCGCAC GCGGAGTCGC TGGAAGCGGA CCTCACAGAG
CGCGACGAGC GGATCGCGGA GCTAGAGGGC GAACTGGAGG AGGCGAGGCG GGAAGAGCGG
ATCGAGGCGC GGAGCCGGCG GGCGGTGTCG CGGCTCGAAC GCGAGACGGA CCGGCTGCAG
CGCGAGCGCG ACGAGGCCAG AGAGCGGGCG GACGAATTAG AGGGGAAAGT GGAGACGCTG
AAAGAGCTGT GGCGGCTCGA TCACTCCAAC TTCGGGGACG TGGCCGCGGA TCAGGGGCTC
GCGAGCGTGA AAGTCGTCGA GCAGTTCACG CTCGACGCGC TGGAGGCCGC AGATGAAGCG
TATGGCCTCG TCGCCGGCGA CGTGGTGTAC CTTCGGGACG CTTCCGGAGC GGGACGGCGG
ACCGCAGAGC GACTGGCGGA GACGGAACCG CGGGCAGTGA TCCGCGACGG GAACCTCTCG
GAGGTGGCCG ACCAGGTGCT CTTCGACCAC AACGTGCCGG TCCTCCCGGC CGACGCGGTC
CCGGTGCGCG AGGTCGACGA ACTGGCGGTT GCGAGCGAGG AGGCCGTGGC GGCCGCCGTC
GACGACTGGG AGCGACGAGC CGAACGCCGG CGAAAGGACG AGAAACAGGA GCACCTCGAC
CGGATCATCT CGGAGCATCG AGCGGGGCGG ACGCTACCGG AGACCGAAGA GTAG
 
Protein sequence
MGYIRSNGLR EYVTTRTIRA RDRPVFGVDV HSGDVRGDDP SYALVILDPV DDDDPDAPDT 
DEPMARVTRD VVSFRKLCRL IDDRKPLYVA TDNAYELAAN KNDLVGFLRS LPDGTRLVQV
TGAERPEPLS RVASRHGIPY GKEPMKEAEA SARLAAANVG HEVTAFTDET QVKVSRGRST
GKGGWSQDRY TRRIHGNVRK RTRQVQSKLK EANLDFERDV TEKYGGYANA TFAVEARPED
IPVSNSRAGD VRVEVERERR DGIEYEPLVK RRDRVIVGID PGTTTAAAVV GLDGTVHALY
SSRTSDTADV TEWIIEQGRP IIVAADVEPM PETVEKFRRS FDAAGWRPTT DLPVDEKLHR
TRETNYDNDH ERDALAAALY AYDDHEDQFE RIAAKTPPRL DREAVIAGVV AGGSSVEAVI
EGLSEDDGSG GGDGGGDGGA EETDPTEPER TEEEETIRRL RERVGRLESH AESLEADLTE
RDERIAELEG ELEEARREER IEARSRRAVS RLERETDRLQ RERDEARERA DELEGKVETL
KELWRLDHSN FGDVAADQGL ASVKVVEQFT LDALEAADEA YGLVAGDVVY LRDASGAGRR
TAERLAETEP RAVIRDGNLS EVADQVLFDH NVPVLPADAV PVREVDELAV ASEEAVAAAV
DDWERRAERR RKDEKQEHLD RIISEHRAGR TLPETEE