Gene Hlac_0978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0978 
Symbol 
ID7401872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp967934 
End bp971701 
Gene Length3768 bp 
Protein Length1255 aa 
Translation table11 
GC content66% 
IMG OID643708044 
Producttype II secretion system protein E 
Protein accessionYP_002565646 
Protein GI222479409 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.742375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACCG AGGACGCCGA GCGCTCGCCC GTCTCGTCCG ACGAGACGGC GGGATCGCCG 
GCGTCCGATC ACCGGGCGTC CGGTCCGCCG CCGTCCGATC CCTCGGTCTC GGATCGCCCG
GTATCAGTGG GACGGTACTC GTGGCGGTCG TTCCTCCGGG ACCGCGGCCG CGACGACGCC
GCGACCGAGC TGTACGCCGA CATCAGCGAG GAGCCGGTCG TCCCCGCGAC CGCCGTCGAC
GCTCACTTCG AAGACAGCGT CGATCGGGTG ATCCGCGCCG CCGGCGTCGA CGAGACGTAT
ACGGCCGACG GCGACACCGC AGTCTCGGGA CACGGGACGC CCGAAGCAGG CACCGTCGTG
ATCGGGACCG ACCACGTCGT CCTCAGCGGT GCGGCCGTCG TTCCGGCTGG GACCTCGACC
GCGGCAGTGG CTCCCGACCC TGTAGAATCC GTCGACGAGA CCGTAGAAGG GGACGAAGAT
GACGTCGAGA CCGATCGCGG CGAAGAAGAC GGCGAAGCAG ACGACGGCGA AGGAGACGAT
GAGGACGGAA GCGATGAGGA CGGAAGCGAT GAGACCGCCG AAGCGGCCGA CGAATCGGCC
GAAAGCGACC GTGCGGACTC GGACGACCCG GACGCTCGCC CCGGCATCTC GATCGACGGC
GCGGGCGTCT CCGGGGTCGT GGTCGTCCCG GCCGAGGACG TGGAGCCGGT TCCCCGCCGC
ACACCGACTG ACGAGGAGTG GTCGCGGGTT GACATCGATC CGAGCGAGTT CCTCGGGTTC
GATCCGTCGG AGACGGGCTA CCGCGTCGGG GCGGCTGCGG CGGTTGGAGA CGTTCTCTGG
GACCTGTGTT CGGCTCGATA CAACCTCTAC GAAGTGCCGG TGCTCAAGGG GTACTACACG
TGGGACGACT ACCGCGACGA GTACTTCCTC GACGAGGAGG GGAACCCGCC GACCGAGGAG
AACGAGGAAG GGGAAGAAGA GCCGCTGGAG TTCACCCACG ACGACAAGGT CGAGGCGCTG
GGGTTCGACC CCGACCGGAC CGAGGAACTA CTGGGGGCGG GCGGGGGCGC CGCGGCCGAC
CTCGCGGAAC TGGTCGACGA GCGCACGGTC GACGTGAATC CGGAGATCGA CGAGGACGCG
TTCTTCTCGA CGGAGGAGGG GCACACCACC CTCGCGAACC GATACGACCT GGAGAAGGCG
GTGCCGATGC CGAAAAAAAC TCACTTCCGG GAGATCGAGC GGTACTGGGT GAACAAACCG
TACGCGTTCG TGATCGTGTT CCGGTCGACG AAGGAGAACG AGGTGAAGTA CTACGCGATT
CAGCCGCACC GGACGGAGAT CGAGACCGAC CTCGTCGAGT TCCTCACTGG GAAGCTCCGG
ACCTCGATCA AGTACGCCGA CGAGTCGATC GCGGGCGGTG ACGAGGAGTT CCGCGAGGGC
GTGATCGTCG ACGAGACGCT GACGCTGCTC GACCGATACG ACCTCTACGA GCGCACGGAC
GACGACCGTG GGGTCGTCGA CGACCTCGTG GACGACCTCG TCGACCGCTT CGGTTTCGAC
CTCACGGAGG GTATCGCCGG CCGGATCAGC GAATCGCTCG GGTACGAGCC GCCGACGGAG
CCGGAGTCAG AATCGGCACC GGCGAAGATA CTCGCTCGGC CCGAGCCCGC GGTGCTCGCA
GAGGACTCGG AGACGCTCTC GAAGCATCAG GTCGAGAAAC TGCTGTACTT CCTCAAGCGC
GACTTCATCG GCTACGAGCG GATCGACCCG ATCAAGTACG ACATCAACGT CGAGGATATC
TCCTGTGACG GGTACAACTC CCCGGTCTTC GTCTACCACT CCGACTACGA GCAGATCATC
ACCAACGTCT ACCACGGGAC CGACGAGCTC GACGACTTCG TCGTGAAGCT CGCGCAGCGC
TCCGGAAAGG GGATCTCGAA GCGGCGCCCG CAGGTGGACG CCACCCTCCC GGACGGCTCC
CGTGCCCAGC TTACGCTCGG TCGCGAGGTC TCGGACCACG GGACCAACTA CACCATCCGG
CAGTTCAACG AAATTCCCTT CACGCCGATC GACCTGATCA ACTGGAAGAC GTTCTCGCTC
GACGAGATGG CCTTCCTGTG GCTCTCCATC GAGAACCACA AGAGCCTGAT CTTCGCGGGC
GGCACCGCCT CCGGGAAGAC GACGAGCCTG AACGCGGTCT CGCTGTTCAT CCCTTCGAAC
GCCAAGATCG TCTCGATCGA GGACACCCGC GAGGTCGAGC TTCCCCAGCG CAACTGGATC
GCCTCCGTCA CTCGCCCCTC CTTCTCCGAC GACGACAAGG GCGACATCGA CGAGTTCGAC
CTGCTGGAGG CCGCGCTCCG TCAGCGTCCC GACTACATCG TGATGGGCGA GATCCGCGGC
GAGGAGGGCC GCACGGCCTT CCAGGTGATG TCGACCGGCC ACACCACCTA CACGACGTTC
CACGCCGACA CCGTCGGCGA GGTGCTCAAG CGATTCACCA CGGAGCCGAT CAACGTCTCG
AAAACGATGT TCACTGCCCT CGATCTGGTC TCCGTCCAGA CGTCGACCCG GGTACAGGGA
AAGAAGGTAC GCCGGAACAA GTCGCTGACC GAGATCAATC ACTACGACGC CGAGAACGAC
GAGATCAACG TCCAAGACGT GTTCCAGTGG CAGGCCGAGA CCGACGAGTT CCTCCAGATG
GGCGACTCGA ACACCTTGGA AGACATCATG TTCGATCGCG GGTGGAGCCG CAAGACGCTC
GACGAGGAGC TCCGGAAGCG CCGCGTCGTG TTAGCGTACC TCATCGATCG CGGGCTCAAC
AGCTACGCGC AGGTGGCCGC GACGTTCCAG GCGTTCATCA ACGACCCGGA GACGGTGCTC
GCGCTGATGG CGAACGAGGA ACTGGAGCGG TCACTGGAGG ACCTCCGCGA GATGGAGTCG
GTGCTGATCA ACGTCGACCG CGACAAAGAG GAGATGGTCC CGCGACCCGA TCCCGACGAG
GCGGGCCGCG AGGAGGTCGA GCGGATCTTA GCGGAGGCCG AGGACTTGTT CGCGGAGTAC
CGCGGCCGGA TGCCCGACTC CGTCGCTGAC GCCCTCCTCG ACGTCGCGCC GGCCCGTAAC
GTGGAGGCGC AACCCGTCGC CGACCGGGAG GCGCTCGCGC AGGCGGCCGA CGAGGCCGCG
GCGCTCGACG GGGAGTCGAG CGCGACGAAC GAAACCGACG AAACTGACGA GACTGAGGAG
GGGCGCATCG TCGCCGACGC AACGCCGCCG ATCGAGGGAT TCGAAGCCGG AGTGACGGCG
GAAGGGAGCT TCGCCGACGG GAGCGACGCC CGGGACGGTG ATCGCACCGG CGACACCAGC
GACGGCGAAA ACGCGGACGA CACGGACGAC CCAGACGGTG GGATCGACTT CGACGAGCCG
TTTGACGAGG GGATCGACGT ACTCGACTCG AGACCGGATT CCGGCCCGGT TCCGGGACCG
AACCCGAATC CGGCGCCAGA TCCGAACCCG GAACAGGCCG ATTCCGGTGT CAGAGCGGAC
GCCGACGGCG GGGTGACCGA AGTCGAGGTC GGGGTCGAAG ATGCCAAAAT CGACGACGAG
GATGCGGGCG ACGCCGCCAC CCGGAAAGAG AGTGGAGACA CCGAGAGCGA CGATGAGGGT
GAAGGCGACA CCGCCACGGA GGGCGAGGAT GAGGATGACG ACGGGAACAA CGCCGACAAC
ATCGACGACT GGGGGTTCGG CTCCGTCGAG TCCCGGGAGG AGCGGTAG
 
Protein sequence
MTTEDAERSP VSSDETAGSP ASDHRASGPP PSDPSVSDRP VSVGRYSWRS FLRDRGRDDA 
ATELYADISE EPVVPATAVD AHFEDSVDRV IRAAGVDETY TADGDTAVSG HGTPEAGTVV
IGTDHVVLSG AAVVPAGTST AAVAPDPVES VDETVEGDED DVETDRGEED GEADDGEGDD
EDGSDEDGSD ETAEAADESA ESDRADSDDP DARPGISIDG AGVSGVVVVP AEDVEPVPRR
TPTDEEWSRV DIDPSEFLGF DPSETGYRVG AAAAVGDVLW DLCSARYNLY EVPVLKGYYT
WDDYRDEYFL DEEGNPPTEE NEEGEEEPLE FTHDDKVEAL GFDPDRTEEL LGAGGGAAAD
LAELVDERTV DVNPEIDEDA FFSTEEGHTT LANRYDLEKA VPMPKKTHFR EIERYWVNKP
YAFVIVFRST KENEVKYYAI QPHRTEIETD LVEFLTGKLR TSIKYADESI AGGDEEFREG
VIVDETLTLL DRYDLYERTD DDRGVVDDLV DDLVDRFGFD LTEGIAGRIS ESLGYEPPTE
PESESAPAKI LARPEPAVLA EDSETLSKHQ VEKLLYFLKR DFIGYERIDP IKYDINVEDI
SCDGYNSPVF VYHSDYEQII TNVYHGTDEL DDFVVKLAQR SGKGISKRRP QVDATLPDGS
RAQLTLGREV SDHGTNYTIR QFNEIPFTPI DLINWKTFSL DEMAFLWLSI ENHKSLIFAG
GTASGKTTSL NAVSLFIPSN AKIVSIEDTR EVELPQRNWI ASVTRPSFSD DDKGDIDEFD
LLEAALRQRP DYIVMGEIRG EEGRTAFQVM STGHTTYTTF HADTVGEVLK RFTTEPINVS
KTMFTALDLV SVQTSTRVQG KKVRRNKSLT EINHYDAEND EINVQDVFQW QAETDEFLQM
GDSNTLEDIM FDRGWSRKTL DEELRKRRVV LAYLIDRGLN SYAQVAATFQ AFINDPETVL
ALMANEELER SLEDLREMES VLINVDRDKE EMVPRPDPDE AGREEVERIL AEAEDLFAEY
RGRMPDSVAD ALLDVAPARN VEAQPVADRE ALAQAADEAA ALDGESSATN ETDETDETEE
GRIVADATPP IEGFEAGVTA EGSFADGSDA RDGDRTGDTS DGENADDTDD PDGGIDFDEP
FDEGIDVLDS RPDSGPVPGP NPNPAPDPNP EQADSGVRAD ADGGVTEVEV GVEDAKIDDE
DAGDAATRKE SGDTESDDEG EGDTATEGED EDDDGNNADN IDDWGFGSVE SREER