Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0946 |
Symbol | |
ID | 8741531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 969325 |
End bp | 973332 |
Gene Length | 4008 bp |
Protein Length | 1335 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 646511524 |
Product | type II secretion system protein E |
Protein accession | YP_003402513 |
Protein GI | 284164234 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATTG ACGAGGCCGA CCAGTCGGAT GCCGGGGACT TTTCTGAGGG GGAGGCATCC GAGTCTGCGT CGGAGGCGAA TCGATCTCGC GGGTCGGACG GCGATGCGAA CTCAGGTTCG GGAGTTTGCG TCGGCGAGTA CACGTGGGAG AATTTCATGG AAGAGTACGG ATACAGCGAC GACGCGTCGG TGCTGTATCC CGACGATCCC GGCGAGACGA CGTCGGACGA CCAACTGGGG TTAGATACCG ACGAAGGGCC CGAACACACG GTCCCGACGG GCGACGACTG GAAGCAGGTC GAGTTCGACC CCGAGTCCTA TCTGGGCTAC CACCCCGACG ATCTGGACTC GACGGTCGTG CCGACCGCCG GCGACAACGC CGAGGAACTC TGGGACGTCT TTCGGGAGTA CGCCGATCCA GAGACGACGC CGGTCGTCAA AGACACCTGG ACCTGGGAGC ACTACAAGTG GGAGTACTAC TACGAGGACG ACGGCTCCCG ACCGCGGGAT AGCGACGGTG AAATCGTTCG CCACGACGAG GAGGAGGCGC TCGGCTTCGA TCCGGAGACC GTAGAACAGC GCCTGTTCGC GGCCGACGAC GCCGCGATGG AACTGGACGA ACTCATCGAG GAGCGAACCG TCAACGTCCA GGAGGAGATC GACGAGGACG AGTTCTTCTC GACGACCGCG GGCAACACGA CCGTCACCAA CCGCTACGAT CTGGAAAAGG CCGTTCCGCT GGACAAGAAG ACCCACTTCC GAGAGGTCGA GCGCTACTGG GTGAACAAAC CCTACGCCTG CGTTGTTATC TTCCACTCCG AGAAGGAAAA CGAGAAGAAG TACTACCTGA TCGAGCCCTA TCTGAACGAG ATCGAACTCG AGTTACAGGA GTTCCTCTCG GGCAAACTGC GAACCGCGAT CAAGTACTCC GACGAGGGGA TCAAGGAGAA GGCGACCGAG GACGGCCGCC GGACGGTCAT CGAGGACGAG ACCCGCCAAC TGCTCAAGCG CTACGATCTG TTCGAGAAGA CAGCCAGCAG TTCCCGGGAG AGCGTCCTCG ACACGCTGCG AAATCTGCTC GACGACGAGG ACGACGGTCC CGACGAGGAC GACGGTCCCG ACCCGCTCGA GGGGCTCTCC GTCCGACCCG AGCCGGCGAT CCTCGAGGCC GATCCGGACA CGCTAAGCGA GTATCAGGTC GAGAAACTGC TCTACCTGCT CAAGCGCAAC TTCATCGGCT ACGAGCGCAT CGACGGGATC AAACACGACA TCAACGTCGA GGACATCTCC GTCGACGGCT ACAACTCGCC GGTGTTCGTC TACCACTCGG AGTACGAGCA GATCATCTCG AACGTCTACC ACGGCGAGGA CGAACTCGAC GACTTCGTCG TCAAACTCGC CCAGCGCTCC GGGAAGGGGA TCAGCAAACG GCTGCCGCAG GTCGACGCGA CCCTCCCCGA CGGCTCGCGC GCCCAGTTGA CCCTGGGGAA GGAGGTGTCC GACCACGGGA CCAACTACAC CATCCGTCAG TTCAAGGACG TCCCCTTTAC CCCGATCGAC CTCATCAACT GGAACACCTT CTCGCTGGAC GAGATGGCGT TCCTCTGGCT CTGTATCGAG AACCACAAGA GCCTGATCTT CGCTGGAGGT ACCGCGTCCG GGAAGACCAC CTCGCTGAAC GCCGTCTCGC TGTTTATCCC CTCGAGCGCG AAGATCGTCT CCATCGAGGA CACCCGGGAG GTCGAACTCC CGCAGCGAAA CTGGATCGCC TCCGTTACTC GGCCGTCGTT CGCCGACGAC GAGCAGGGCG ACGTCGACGA GTTCGACCTG CTCGAGGCCG CGCTCCGGCA GCGACCCGAC TACATCGTGA TGGGTGAGAT CCGTGGTGAG GAGGGGCGGA CGCTGTTCCA GGTCATGTCG ACGGGTCACA CGACCTACAC GACGTTCCAC GCCGACTCCG TCGACGAGGT GTTAAAGCGG TTCACGACGG ACCCGATCAA CGTCTCGAAG ACGATGTTCA CCGCGCTGGA CCTGGTTTCG ATCCAGACCC AGACGCGGGT GCAGGGTCGG AAGGTCCGCC GGAACAAATC TCTCACGGAG ATCAACCACT ACGAGGCCGA ACACGACGAG ATCAACGTCC AGGACGTCTA CCAGTGGCAG GCCGAGACCG ACGAGTTCCT CAAAATGGGG GACTCGAATA CCTTGGAGGA GATCCAGTTC GACCGCGGCT GGAGCCGCGA GAAACTCGAG GAGGAACTGT TCAAACGCGA GGTCATCCTC GCCTACCTCA TCAAGAACGG ACTCAACACG TACGCGGAGG TCGCGGCGAC AGTGCAGGCG TTCATCAACG ACCCCGACAC GATCCTCACG CTCATCGCGA ACGGCCAGCT CGAGGACAGC CTCGAGGACC TCCGGGAGAT GGAGAGCGTC CTGATCGACG TCGATCCGGA GAAAGAGGAA CTCGTCCCGC GACCGGACGC GACCGACGAG ACGTACAACA TCTCGATGGA CCTGCTCGAA CGCGCCGAGG AGTCGCTGTT CGAGGAGTAC CGCGGCAAAA CGCCGAGCGG ATTGGACAGC GCTCTCGGGG GGCCCGAGAC GGAGGAGCCG ATCGAGGTTG ACAGCGCCGA CGCCGACGAG TTCGACTTCG CCGGCGACGT CGACGGCTCG GTCGACGACG ACGAGTGGGA ACTCGGCGAC GGCTCGAGCG GCTTCGGCGC CGGCGAGGAC GCCGGGGAGC CGGCGTGGCT CAGCGACGAC ACCGGCTTCG ATATCGGCGG CGACGAGGGC GCCGCCGACA GCGCCGGAAC GGCTGCGAAC GCCGACGAGG GGGTCGCCGC TGAATCGCCT GATGCCGGCA CCACGGACGA GACGTCCGAC GGATTCGAGA TCGATGACGA GACGGGCGGC GTCAGCGGCC CACCTGCGCC GGCGGCGAGC GCGGACGGCG CGGACGCGGA GGCGACGGCC CCGCAGCCGG CAGCCGGCGA CGAGACGGAG ACGAACACAA CCGTCATGCC GACGGATGAC GCCGACGACG CCGATCTCGG CGGGCTGTTC GACGATATGG GCGACACCAT CGACAGGCTC GACGCGGACG GCCAGCCGGA ACCGCCCGCG GAGACCGACG CCGCGTCGAC GGCCGAGCCC GACACGTCCG GCTTCGATTC GATGTTCCCC GAAGACGACC TCGAGTCGAT CTTCGACCCC GAATCCGACG CGGAGACGGC CGGCGACTTC AGCGATCAGT TCGAGCCGGC GACCGCCGAC GAGCCGGTCG GCGACGAACT CGAGACCGCC GACGAACCCG ACGAGACGCC GACGATCGAT CTCGGAGCGG CGGTCGACGA CGCTGCCGAA GCCGACGAGA CATCGACGGC CGCACCCGAG GAGTCGACCG ACGAAACCGC TGAACCCGAC GAGACGCCGA CGATCGACCT CGGAGCGGCG GTCGACGACT CCGCCGGGGC GGACGACGAC GCCGAAGAGG TTGCCGAATC GGAAGCTGAC GCCGATTCCT CGAGTATCAT CGACGACGGC AGCGACGATG ATGCCGCTGT CGAGAACGCC GATAGTATCG TGGCCGACGA ACCGACGGAC GATGAGGCGG AGTCGGGACC GGACGCCACA CCCTCGTCGA ACGAGCAAAG CGAAGGTGAC GGGCGTCTCG AAGGCGATGG CGACGACGCT CCGGTCGACC AGCCGGCCGA CGAGGAACCG ACGGCGACCG CAGCTGATCC ACCCGGCGAG GATCGCGACG ATCCGACGGA GCCGGCGGAA GGTGACGATG GTACCGGTGA CTCGGAGTCG ATCTTCGGGA CCAAGTCCGA CTCGATCTTC TCCGAGGACT CGGAGGCGGA CGACGCCGAT GACGGGTCGC TGTTCGAGGA CGAACAGGCG CAGGCCGAAA CGGACGATTC GATCTTCAAA CCGGCCGAGA CCGAGGACGA CGAGCGGACG GACGACCAAA TCGATCCCGC CGACGATAAC GAGGATACCA ACGCATGA
|
Protein sequence | MAIDEADQSD AGDFSEGEAS ESASEANRSR GSDGDANSGS GVCVGEYTWE NFMEEYGYSD DASVLYPDDP GETTSDDQLG LDTDEGPEHT VPTGDDWKQV EFDPESYLGY HPDDLDSTVV PTAGDNAEEL WDVFREYADP ETTPVVKDTW TWEHYKWEYY YEDDGSRPRD SDGEIVRHDE EEALGFDPET VEQRLFAADD AAMELDELIE ERTVNVQEEI DEDEFFSTTA GNTTVTNRYD LEKAVPLDKK THFREVERYW VNKPYACVVI FHSEKENEKK YYLIEPYLNE IELELQEFLS GKLRTAIKYS DEGIKEKATE DGRRTVIEDE TRQLLKRYDL FEKTASSSRE SVLDTLRNLL DDEDDGPDED DGPDPLEGLS VRPEPAILEA DPDTLSEYQV EKLLYLLKRN FIGYERIDGI KHDINVEDIS VDGYNSPVFV YHSEYEQIIS NVYHGEDELD DFVVKLAQRS GKGISKRLPQ VDATLPDGSR AQLTLGKEVS DHGTNYTIRQ FKDVPFTPID LINWNTFSLD EMAFLWLCIE NHKSLIFAGG TASGKTTSLN AVSLFIPSSA KIVSIEDTRE VELPQRNWIA SVTRPSFADD EQGDVDEFDL LEAALRQRPD YIVMGEIRGE EGRTLFQVMS TGHTTYTTFH ADSVDEVLKR FTTDPINVSK TMFTALDLVS IQTQTRVQGR KVRRNKSLTE INHYEAEHDE INVQDVYQWQ AETDEFLKMG DSNTLEEIQF DRGWSREKLE EELFKREVIL AYLIKNGLNT YAEVAATVQA FINDPDTILT LIANGQLEDS LEDLREMESV LIDVDPEKEE LVPRPDATDE TYNISMDLLE RAEESLFEEY RGKTPSGLDS ALGGPETEEP IEVDSADADE FDFAGDVDGS VDDDEWELGD GSSGFGAGED AGEPAWLSDD TGFDIGGDEG AADSAGTAAN ADEGVAAESP DAGTTDETSD GFEIDDETGG VSGPPAPAAS ADGADAEATA PQPAAGDETE TNTTVMPTDD ADDADLGGLF DDMGDTIDRL DADGQPEPPA ETDAASTAEP DTSGFDSMFP EDDLESIFDP ESDAETAGDF SDQFEPATAD EPVGDELETA DEPDETPTID LGAAVDDAAE ADETSTAAPE ESTDETAEPD ETPTIDLGAA VDDSAGADDD AEEVAESEAD ADSSSIIDDG SDDDAAVENA DSIVADEPTD DEAESGPDAT PSSNEQSEGD GRLEGDGDDA PVDQPADEEP TATAADPPGE DRDDPTEPAE GDDGTGDSES IFGTKSDSIF SEDSEADDAD DGSLFEDEQA QAETDDSIFK PAETEDDERT DDQIDPADDN EDTNA
|
| |