Gene Hlac_1062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1062 
Symbol 
ID7400134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1058680 
End bp1061826 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content69% 
IMG OID643708130 
ProductOligosaccharyl transferase STT3 subunit 
Protein accessionYP_002565729 
Protein GI222479492 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT CGAGCGATCC ACCGGCGGCG TCGGATCGGT CCTCCGCCGA CCTCCTCGCA 
CGGTGGTACC ACGTCCCGGT TCTTCTCGGC GTACTGGCCT TCATGCTCTG GACCCGCCTC
CAGTCGTACG GGAACTTCGT CCAGAACGGC GAGGTGTACT TCCGCGGCAA CGACCCGTGG
TACCACTTCC GCGAAACGAT GTACATCATG GAGAACTACC CGAATCGGAT GCCCTTCGAC
GTCTGGACCG GGTTCCCGCT CGGGAACCAG GCCGGCCAGT TCGGCACACT CTGGGACCAC
ATCATGGCGG TCGGCATCTG GATCGCTCGC CCGATCATGG GCAGCGCCGA GGAGGTCATG
CTCATCATGG CCCCCATCGC GGGGATGCTC GTTGCGATCC CGACGTACTT CATCGCACGT
CGGTTCGTCG ATCGCTTCGC GGCGCTCGCC GCGGTCGTCG TGCTCGCGCT CCTGCCGGGG
ACCTTCTTCA GCTACAGTCT CGTCGGCTTC CCGGACCACA GTGCGGCCGA AGTGCTCTTC
CAGAGTCTCG CCATCCTCGC GTTCCTCGTC GCCGTCGCGG TCGCCGAGCG CGAGGCACCC
GTCTGGGAGC TCGTCGTCGA CCGCGACTGG GCCGCACTCA AGCGTCCCGC CGCGTACGCG
GCCGCCGCGG GCGTCGCGCT CGGTCTCTAC ATGTGGACGT GGCAGCCGGG AGCCCTCATG
GTCGGGTTCA CCGGAGTCTT CCTCGCGATC AAGATCACCA GCGACGTGTC CCACGGGAAG
AGCCCGGAGC CCATCGCCTT CGCGGGCGCG GTGTCGATGG TCGTCGCCGG ACTGATGCAG
ATCGTCCCGC TCGACACGTT CCGGTTCGCT GTGAGCGAGT ACTCGTTCCT CCAGATCGTC
TCCCCGCTCG GCGTCGCGCT CGGCGCGGTC TTCCTCGCGT GGCTCGCCCG CCAGTGGGAG
TCGCGCGACC TCGACGCAGC CACCTACCCG CCCGCGGTCG GCGGCCTCAT TCTCGCGTCC
GCGGGCGTCG TCTGGCTCGC GATCCCCTCG CTGTGGTCGA CGCTCACCGG CAGCCTCCTC
AACACCGTCG GCTTCTCGGC CAGCGCCGGC GCTCGCACCA TCGGTGAGGC CCAGCCGCCC
CTCCAGGACG CCTCCCTCGC CGACTTCGTC CTCGGCCAGT ACGGCCTCGC GTTCTTCCTC
GCGCTCGCCG CTGTTCTCTA CATCCTCGCG CGTCCCCTCT ACCGTTCCGA CGACGCGAAC
CACACGCTCT ACATCCCGGC CGCACTCGCC GTGGTCGGCT CCGTCTACGC GGTCCCCCAG
CTCTACGACG CCATCGGCGG CGTCGTCGGC GTGAGCTGGC AGGTGATCGG ACTCCTCATC
GCGGCCGCCT TGCTCGTCGG CGCGACGTTC CTCGTCGAGT ACGACGCCGA GGAGCTGTAC
TTCGTCGTGT GGGCGGCGTT CATCGGGAGC GCCGCGTTCA CCCAAGTTCG CTTCAACTAC
TACCTCGCGG TGATCGTCGC GGTCGGCGCG GCCTACTTCG TGCAGCTGGC CGTCGACTTC
CTCCGCCTGC GCTCGCTCTC GGGCGTTCGC GACGTCGAGG GGTGGCAGGC GATGGGCGCG
CTCGTCCTCG TCGTCATCGT CCTCGTCCCG CTCGTCGCGA TGGCGACGCC GGTGTGGGCC
GCGGGCGCGA ACACCGGCCC CGGCAGCGTC ACGCAGTGGG ACGGGAGCCT CCAGTGGATG
AACGACGAGA CGCCCACGCC GGGCGAGCTC GAAGGCGCCG ACAACCCCAT GGAGCTGTAC
GGCACCTACG AGCGCCCCGC CGACGGCGAC TTCGAGTACC CGGAGGGCGC GTACGGCGTG
CAGTCGTGGT GGGACTACGG CCACTGGATC ACCACGCGCG CCGAGCGCAT CCCGAACGCG
AACCCGTTCC AGCAGAACGC GGGCGAAGCG GCTGACTACC TCCTGGCACC GAGCGAAGAG
GCGTCCCGCG AGGTGCTCGC GAGCCAGAGC ACGGAGGGCG AGAACACCCG CTACGTGATG
GTGGACTCGC AGATGGCCTC GCCGAACTCC AAGTTCGGCG CGCCCGTCAC GTTCTACTCG
GGCAACGAGA CGCGCGACGA CTTCAACCGC GTCCTCTATC AGCAGACCGA GCAGGGCGGG
TTCCAGACGG TGATGCAGGT GAACACGCAG CGCTACCACG AGAGCCAGAT GATTCGGCTG
TACGAGCACT ACGGCAGCGC GGTCGACCCC GCACCGGTCG TCGTCGACTG GGAGACGCAG
AGCGCCCAGA CCGGGAGCGG CGAGCAGATC GAGATTAACA CCCTCCCGAG CGATCCGGGA
GCGACGATCC GACAGTTCGA CAACGTCTCG GCCGCCCGCG CGTACGTCGA GGAAGACGGC
AGCGCGCAGC TCGGCGGCGT CGGCGACCTC CCCTCGGAGC GCGTCGAAGC CCTCGAACAC
CACCGGCTCG TCCACACATC GCAGGCGGCA GGGCAGTCGC CCTCGGCGAG ACAGGTGCAG
ATCCTGCAGC AGCTCGGCGT CGACGTGCAG AGCGTCCTCG GCGAGTCGAC CCTGCAGGCG
TTCCAAGACG ACTTCGTGAA GACGTTCGAA CGCGTTCCCG GCGCGACGAT CGAAGGCTCG
GGCGCGGCGC CCGGCCAAGA GGTCGAAGCG ACCGTGGAGC TGGAGAAGTC CACGGGACAG
ACCTTCGAGT ACACCCAGTA CGCCGAGGCC GACGAGAACG GGAACTTCGA GCTCACGGTC
CCGTACTCGA CGACGGGCTA CGACGAGTTC GGTCCCGAGA ACGGCTACAC GAACACGAGC
GTCCGCGCCA CTGGCCCCTA CAACGTCACC ACAGAGGCGA CCACCGACGA CGACCTCACC
ACGACACAGC GCGTCGGGCA GGTCGAGGTG ACCGAGGGGC AGGTCGTCGG CGCGGACGAC
GCGGCGGCGA CGGTCGACCT CACCGAGGAG GTCATCGACT GCCCGAGCGG CGACCCGGAC
TGTTCGGTCG AGCAGAGCGA CAAGGGCAGT GACGGGTCCG ACGGAAGCGA CGGCAGCGAC
GACGGCAACA CCACCAACGC GGTCGACGGC GCGATGACAC TGGTCGCGAC CGAGTCGGCC
GCGACGATGA CGCCCGTCGC CGCCTGA
 
Protein sequence
MSDSSDPPAA SDRSSADLLA RWYHVPVLLG VLAFMLWTRL QSYGNFVQNG EVYFRGNDPW 
YHFRETMYIM ENYPNRMPFD VWTGFPLGNQ AGQFGTLWDH IMAVGIWIAR PIMGSAEEVM
LIMAPIAGML VAIPTYFIAR RFVDRFAALA AVVVLALLPG TFFSYSLVGF PDHSAAEVLF
QSLAILAFLV AVAVAEREAP VWELVVDRDW AALKRPAAYA AAAGVALGLY MWTWQPGALM
VGFTGVFLAI KITSDVSHGK SPEPIAFAGA VSMVVAGLMQ IVPLDTFRFA VSEYSFLQIV
SPLGVALGAV FLAWLARQWE SRDLDAATYP PAVGGLILAS AGVVWLAIPS LWSTLTGSLL
NTVGFSASAG ARTIGEAQPP LQDASLADFV LGQYGLAFFL ALAAVLYILA RPLYRSDDAN
HTLYIPAALA VVGSVYAVPQ LYDAIGGVVG VSWQVIGLLI AAALLVGATF LVEYDAEELY
FVVWAAFIGS AAFTQVRFNY YLAVIVAVGA AYFVQLAVDF LRLRSLSGVR DVEGWQAMGA
LVLVVIVLVP LVAMATPVWA AGANTGPGSV TQWDGSLQWM NDETPTPGEL EGADNPMELY
GTYERPADGD FEYPEGAYGV QSWWDYGHWI TTRAERIPNA NPFQQNAGEA ADYLLAPSEE
ASREVLASQS TEGENTRYVM VDSQMASPNS KFGAPVTFYS GNETRDDFNR VLYQQTEQGG
FQTVMQVNTQ RYHESQMIRL YEHYGSAVDP APVVVDWETQ SAQTGSGEQI EINTLPSDPG
ATIRQFDNVS AARAYVEEDG SAQLGGVGDL PSERVEALEH HRLVHTSQAA GQSPSARQVQ
ILQQLGVDVQ SVLGESTLQA FQDDFVKTFE RVPGATIEGS GAAPGQEVEA TVELEKSTGQ
TFEYTQYAEA DENGNFELTV PYSTTGYDEF GPENGYTNTS VRATGPYNVT TEATTDDDLT
TTQRVGQVEV TEGQVVGADD AAATVDLTEE VIDCPSGDPD CSVEQSDKGS DGSDGSDGSD
DGNTTNAVDG AMTLVATESA ATMTPVAA