Gene Hlac_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2006 
Symbol 
ID7402025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2000174 
End bp2001763 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content71% 
IMG OID643709077 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_002566654 
Protein GI222480417 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.300214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.433718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGTC GATACGTGAA CGACGGACTC GGTGCCGGTG GCGACGGCGA GCGCGAGGCG 
GCGAGCGACG ACGACAGCGT CCCCGACGCC GAGCGCGAGG CGCTCCTGAC GATCGCGGGC
GGAGCCGTGA TCACCGCCGG CGGTGTCTCC GGCCAGCGCG CGCTCACGGC CGTGACGGAG
TTCGCGCTCG CTCGCGGACT CGGCCCGGCC GCCTACGGCG TGTACGCACT GGCGTGGCGG
ATCGCCCAGT TGCTCTCGCG GCTCGTCACG TTCGGGAGCG TGCCGGCGCT CCAGCGCTAC
CTCCCCGAGT ACGCGGACGA CCCCGACCGA CAGGGGGTCG TCGCCGGGCT CGCGTACGCC
ACGACGCTCG GATTCGGTGC CGCCATCGCC GCCGGAATCT GGGTGGCAGC ACCGCGGATC
AACGCGCTCA CCGTCGAGGC CCCCGCGTTC CCGCCGACGA TGCGCGCGTT CGGCTTCCTC
GTCGGATCGC TCGGCGTCGT GATGGTCGCG TCGGCGATCT TCCGCGCGGT CGGGTCCGCG
CGCGGCGAGA TCGCCTTCAA CAAACTCCTC CGCCCGGGCG TCCGGCTCGT GGGCGCGCTT
ACGGCGCTGG CGCTCGGCTA CTCGGTCGTC GGCGTCGCCG GCGGCATCGT CGTCGCGACC
GCGCTGCTCG CGGCCGTCGC CGCGCCCCTC TCCGCGCGGG TGACCGGGAT CGTCCCGTCG
CTGCGGGGAG TCCGTGGGGA GGCGGGGCGG TTCTACAACC ACGCGGCGCC GGTCGCGATG
AGCAGCCTCG GGAAGGTGTT CCAGAACCGC GTTGACGTGT TGCTCGTCGG AGCGCTGTTG
ACGGCGACCG CCGCGGGCGT GTACAACGTC GTCCTCGTGT TGATCGCGAT CGCGTGGATC
CCCCTCATCG CGTTCAACCA ACTGCTGCCG CCGGTCGCCT CGGATCTGTA CGCCGACGAT
CGGATCGAAA CGCTCAACGC GGTGTACACG TCGGTGACCC GTCAGATCGT CACGAGCGTG
ATCCCGATCC TCGCCGTGCT CGTGGTGTAC GGCCGGGAGC TACTCGGGCT GTTCGGCGAG
CCGTACGTCG CAGGGTACGC CCCCCTCGTC GTCTACCTCG GCGGGGTGTT CGTCGGCAGC
GCGGTGGGCG CGACCGGCTG GCTCCTGATG ATGACCGACC ACCAGTACGC CCGGATGGCG
CTCGACTGGC TGCTCGCCGT CCTCAACGTC GCCTTAACGT ACGCATTCGT GGTCCGGTAT
GGGCTCGTCG GCGCCGCGCT CGGCACCTCG CTCGCGATCG CGGTGCAAAA CGCGATTCAG
GTCATCCTGT TGCGCCGCTT CGAGGGGCTG TGGCCGTTCG ACCGCACCTA CCTCACCCCG
CTGGTGGCCG GCGGCGTGAC GTTCCTCGCG ATGCGAGCAA TTCGGGAGGT TGCCCCCGGG
CGAGCCGCGG TCGTCGTCGG GGCCGCGGGC GGGCTCGTGG TCTACGCGGG CACGCTACAC
GTCCTCGGCG TTGATCCCCG AGACCGGCTC GTCGCACGAG AGCTTGCGGG GCGGTACCGT
GGGGCCCTCG CCGAGTGGCT CGGTCGGTAA
 
Protein sequence
MSSRYVNDGL GAGGDGEREA ASDDDSVPDA EREALLTIAG GAVITAGGVS GQRALTAVTE 
FALARGLGPA AYGVYALAWR IAQLLSRLVT FGSVPALQRY LPEYADDPDR QGVVAGLAYA
TTLGFGAAIA AGIWVAAPRI NALTVEAPAF PPTMRAFGFL VGSLGVVMVA SAIFRAVGSA
RGEIAFNKLL RPGVRLVGAL TALALGYSVV GVAGGIVVAT ALLAAVAAPL SARVTGIVPS
LRGVRGEAGR FYNHAAPVAM SSLGKVFQNR VDVLLVGALL TATAAGVYNV VLVLIAIAWI
PLIAFNQLLP PVASDLYADD RIETLNAVYT SVTRQIVTSV IPILAVLVVY GRELLGLFGE
PYVAGYAPLV VYLGGVFVGS AVGATGWLLM MTDHQYARMA LDWLLAVLNV ALTYAFVVRY
GLVGAALGTS LAIAVQNAIQ VILLRRFEGL WPFDRTYLTP LVAGGVTFLA MRAIREVAPG
RAAVVVGAAG GLVVYAGTLH VLGVDPRDRL VARELAGRYR GALAEWLGR