Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0978 |
Symbol | |
ID | 7401872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 967934 |
End bp | 971701 |
Gene Length | 3768 bp |
Protein Length | 1255 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643708044 |
Product | type II secretion system protein E |
Protein accession | YP_002565646 |
Protein GI | 222479409 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.742375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACCG AGGACGCCGA GCGCTCGCCC GTCTCGTCCG ACGAGACGGC GGGATCGCCG GCGTCCGATC ACCGGGCGTC CGGTCCGCCG CCGTCCGATC CCTCGGTCTC GGATCGCCCG GTATCAGTGG GACGGTACTC GTGGCGGTCG TTCCTCCGGG ACCGCGGCCG CGACGACGCC GCGACCGAGC TGTACGCCGA CATCAGCGAG GAGCCGGTCG TCCCCGCGAC CGCCGTCGAC GCTCACTTCG AAGACAGCGT CGATCGGGTG ATCCGCGCCG CCGGCGTCGA CGAGACGTAT ACGGCCGACG GCGACACCGC AGTCTCGGGA CACGGGACGC CCGAAGCAGG CACCGTCGTG ATCGGGACCG ACCACGTCGT CCTCAGCGGT GCGGCCGTCG TTCCGGCTGG GACCTCGACC GCGGCAGTGG CTCCCGACCC TGTAGAATCC GTCGACGAGA CCGTAGAAGG GGACGAAGAT GACGTCGAGA CCGATCGCGG CGAAGAAGAC GGCGAAGCAG ACGACGGCGA AGGAGACGAT GAGGACGGAA GCGATGAGGA CGGAAGCGAT GAGACCGCCG AAGCGGCCGA CGAATCGGCC GAAAGCGACC GTGCGGACTC GGACGACCCG GACGCTCGCC CCGGCATCTC GATCGACGGC GCGGGCGTCT CCGGGGTCGT GGTCGTCCCG GCCGAGGACG TGGAGCCGGT TCCCCGCCGC ACACCGACTG ACGAGGAGTG GTCGCGGGTT GACATCGATC CGAGCGAGTT CCTCGGGTTC GATCCGTCGG AGACGGGCTA CCGCGTCGGG GCGGCTGCGG CGGTTGGAGA CGTTCTCTGG GACCTGTGTT CGGCTCGATA CAACCTCTAC GAAGTGCCGG TGCTCAAGGG GTACTACACG TGGGACGACT ACCGCGACGA GTACTTCCTC GACGAGGAGG GGAACCCGCC GACCGAGGAG AACGAGGAAG GGGAAGAAGA GCCGCTGGAG TTCACCCACG ACGACAAGGT CGAGGCGCTG GGGTTCGACC CCGACCGGAC CGAGGAACTA CTGGGGGCGG GCGGGGGCGC CGCGGCCGAC CTCGCGGAAC TGGTCGACGA GCGCACGGTC GACGTGAATC CGGAGATCGA CGAGGACGCG TTCTTCTCGA CGGAGGAGGG GCACACCACC CTCGCGAACC GATACGACCT GGAGAAGGCG GTGCCGATGC CGAAAAAAAC TCACTTCCGG GAGATCGAGC GGTACTGGGT GAACAAACCG TACGCGTTCG TGATCGTGTT CCGGTCGACG AAGGAGAACG AGGTGAAGTA CTACGCGATT CAGCCGCACC GGACGGAGAT CGAGACCGAC CTCGTCGAGT TCCTCACTGG GAAGCTCCGG ACCTCGATCA AGTACGCCGA CGAGTCGATC GCGGGCGGTG ACGAGGAGTT CCGCGAGGGC GTGATCGTCG ACGAGACGCT GACGCTGCTC GACCGATACG ACCTCTACGA GCGCACGGAC GACGACCGTG GGGTCGTCGA CGACCTCGTG GACGACCTCG TCGACCGCTT CGGTTTCGAC CTCACGGAGG GTATCGCCGG CCGGATCAGC GAATCGCTCG GGTACGAGCC GCCGACGGAG CCGGAGTCAG AATCGGCACC GGCGAAGATA CTCGCTCGGC CCGAGCCCGC GGTGCTCGCA GAGGACTCGG AGACGCTCTC GAAGCATCAG GTCGAGAAAC TGCTGTACTT CCTCAAGCGC GACTTCATCG GCTACGAGCG GATCGACCCG ATCAAGTACG ACATCAACGT CGAGGATATC TCCTGTGACG GGTACAACTC CCCGGTCTTC GTCTACCACT CCGACTACGA GCAGATCATC ACCAACGTCT ACCACGGGAC CGACGAGCTC GACGACTTCG TCGTGAAGCT CGCGCAGCGC TCCGGAAAGG GGATCTCGAA GCGGCGCCCG CAGGTGGACG CCACCCTCCC GGACGGCTCC CGTGCCCAGC TTACGCTCGG TCGCGAGGTC TCGGACCACG GGACCAACTA CACCATCCGG CAGTTCAACG AAATTCCCTT CACGCCGATC GACCTGATCA ACTGGAAGAC GTTCTCGCTC GACGAGATGG CCTTCCTGTG GCTCTCCATC GAGAACCACA AGAGCCTGAT CTTCGCGGGC GGCACCGCCT CCGGGAAGAC GACGAGCCTG AACGCGGTCT CGCTGTTCAT CCCTTCGAAC GCCAAGATCG TCTCGATCGA GGACACCCGC GAGGTCGAGC TTCCCCAGCG CAACTGGATC GCCTCCGTCA CTCGCCCCTC CTTCTCCGAC GACGACAAGG GCGACATCGA CGAGTTCGAC CTGCTGGAGG CCGCGCTCCG TCAGCGTCCC GACTACATCG TGATGGGCGA GATCCGCGGC GAGGAGGGCC GCACGGCCTT CCAGGTGATG TCGACCGGCC ACACCACCTA CACGACGTTC CACGCCGACA CCGTCGGCGA GGTGCTCAAG CGATTCACCA CGGAGCCGAT CAACGTCTCG AAAACGATGT TCACTGCCCT CGATCTGGTC TCCGTCCAGA CGTCGACCCG GGTACAGGGA AAGAAGGTAC GCCGGAACAA GTCGCTGACC GAGATCAATC ACTACGACGC CGAGAACGAC GAGATCAACG TCCAAGACGT GTTCCAGTGG CAGGCCGAGA CCGACGAGTT CCTCCAGATG GGCGACTCGA ACACCTTGGA AGACATCATG TTCGATCGCG GGTGGAGCCG CAAGACGCTC GACGAGGAGC TCCGGAAGCG CCGCGTCGTG TTAGCGTACC TCATCGATCG CGGGCTCAAC AGCTACGCGC AGGTGGCCGC GACGTTCCAG GCGTTCATCA ACGACCCGGA GACGGTGCTC GCGCTGATGG CGAACGAGGA ACTGGAGCGG TCACTGGAGG ACCTCCGCGA GATGGAGTCG GTGCTGATCA ACGTCGACCG CGACAAAGAG GAGATGGTCC CGCGACCCGA TCCCGACGAG GCGGGCCGCG AGGAGGTCGA GCGGATCTTA GCGGAGGCCG AGGACTTGTT CGCGGAGTAC CGCGGCCGGA TGCCCGACTC CGTCGCTGAC GCCCTCCTCG ACGTCGCGCC GGCCCGTAAC GTGGAGGCGC AACCCGTCGC CGACCGGGAG GCGCTCGCGC AGGCGGCCGA CGAGGCCGCG GCGCTCGACG GGGAGTCGAG CGCGACGAAC GAAACCGACG AAACTGACGA GACTGAGGAG GGGCGCATCG TCGCCGACGC AACGCCGCCG ATCGAGGGAT TCGAAGCCGG AGTGACGGCG GAAGGGAGCT TCGCCGACGG GAGCGACGCC CGGGACGGTG ATCGCACCGG CGACACCAGC GACGGCGAAA ACGCGGACGA CACGGACGAC CCAGACGGTG GGATCGACTT CGACGAGCCG TTTGACGAGG GGATCGACGT ACTCGACTCG AGACCGGATT CCGGCCCGGT TCCGGGACCG AACCCGAATC CGGCGCCAGA TCCGAACCCG GAACAGGCCG ATTCCGGTGT CAGAGCGGAC GCCGACGGCG GGGTGACCGA AGTCGAGGTC GGGGTCGAAG ATGCCAAAAT CGACGACGAG GATGCGGGCG ACGCCGCCAC CCGGAAAGAG AGTGGAGACA CCGAGAGCGA CGATGAGGGT GAAGGCGACA CCGCCACGGA GGGCGAGGAT GAGGATGACG ACGGGAACAA CGCCGACAAC ATCGACGACT GGGGGTTCGG CTCCGTCGAG TCCCGGGAGG AGCGGTAG
|
Protein sequence | MTTEDAERSP VSSDETAGSP ASDHRASGPP PSDPSVSDRP VSVGRYSWRS FLRDRGRDDA ATELYADISE EPVVPATAVD AHFEDSVDRV IRAAGVDETY TADGDTAVSG HGTPEAGTVV IGTDHVVLSG AAVVPAGTST AAVAPDPVES VDETVEGDED DVETDRGEED GEADDGEGDD EDGSDEDGSD ETAEAADESA ESDRADSDDP DARPGISIDG AGVSGVVVVP AEDVEPVPRR TPTDEEWSRV DIDPSEFLGF DPSETGYRVG AAAAVGDVLW DLCSARYNLY EVPVLKGYYT WDDYRDEYFL DEEGNPPTEE NEEGEEEPLE FTHDDKVEAL GFDPDRTEEL LGAGGGAAAD LAELVDERTV DVNPEIDEDA FFSTEEGHTT LANRYDLEKA VPMPKKTHFR EIERYWVNKP YAFVIVFRST KENEVKYYAI QPHRTEIETD LVEFLTGKLR TSIKYADESI AGGDEEFREG VIVDETLTLL DRYDLYERTD DDRGVVDDLV DDLVDRFGFD LTEGIAGRIS ESLGYEPPTE PESESAPAKI LARPEPAVLA EDSETLSKHQ VEKLLYFLKR DFIGYERIDP IKYDINVEDI SCDGYNSPVF VYHSDYEQII TNVYHGTDEL DDFVVKLAQR SGKGISKRRP QVDATLPDGS RAQLTLGREV SDHGTNYTIR QFNEIPFTPI DLINWKTFSL DEMAFLWLSI ENHKSLIFAG GTASGKTTSL NAVSLFIPSN AKIVSIEDTR EVELPQRNWI ASVTRPSFSD DDKGDIDEFD LLEAALRQRP DYIVMGEIRG EEGRTAFQVM STGHTTYTTF HADTVGEVLK RFTTEPINVS KTMFTALDLV SVQTSTRVQG KKVRRNKSLT EINHYDAEND EINVQDVFQW QAETDEFLQM GDSNTLEDIM FDRGWSRKTL DEELRKRRVV LAYLIDRGLN SYAQVAATFQ AFINDPETVL ALMANEELER SLEDLREMES VLINVDRDKE EMVPRPDPDE AGREEVERIL AEAEDLFAEY RGRMPDSVAD ALLDVAPARN VEAQPVADRE ALAQAADEAA ALDGESSATN ETDETDETEE GRIVADATPP IEGFEAGVTA EGSFADGSDA RDGDRTGDTS DGENADDTDD PDGGIDFDEP FDEGIDVLDS RPDSGPVPGP NPNPAPDPNP EQADSGVRAD ADGGVTEVEV GVEDAKIDDE DAGDAATRKE SGDTESDDEG EGDTATEGED EDDDGNNADN IDDWGFGSVE SREER
|
| |