Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0563 |
Symbol | |
ID | 8418375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 677567 |
End bp | 682687 |
Gene Length | 5121 bp |
Protein Length | 1706 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 645037131 |
Product | type 1 secretion C-terminal target domain protein |
Protein accession | YP_003197438 |
Protein GI | 258404696 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01965] VCBS repeat [TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAA ACACAATGAA CGTAGCGCAA ATGGAAGGGC AGATCGAGGA AGGGGGGCGC GAGACATCGA ACGCCGCGGC CGAGAATCTT CAGACCTCGC AACAAACGCA ACAACCGATC CCCGTCCCTG AAGCGGGAGA AACCCGTGTT GTTCAGGTCG CCCCAGGTGA TGTTATCCAA CTCGGTGCTG ATGTCACCGA AGCTGAACTC ATTCAGCAGG GGGGAAGTGT CCTCCTCCGT TTTTCAGATG GAGGAGACCT GCTTTTGGAA GATTTTATCA ATCAGACAGA CGCGGGCGAA CCACCCACGT TGATTCTTGC TGACGGCTCT GAGGTAACCG CCGACCAGAT CATTGCTGCC ATGACACCAG CACCTGATGC GGCTCCTGCT GCCGGAGAAG GGCCGACCAG CGGTGGAGCG GGCGAATATA GAGAAGATGT TGGCAATCTC ATAGAAGGCG TAAACCGGTT GGACGGCCTC GGACCGGATG CCTTGGCAGC CGATGCCGCT TTGGTCCCAG AGGCAGAAGG AGAGGGGATT CTCCCCGAAG TGGAGGAAGA CGCCCTTCCT CTGGCCGCTG ATGACGCCTT TACTATTGAA GAGGACCCGG AATCTCCTTT GGAAGGCGAT CTGAGCCTCA ACGACGATCC TGGTGATGCC CCGGCGACAT TTGCCATTTT GGACGGCCCC GAAAACGGCA CGGCGACCGT CAATCCGGAC GGGACCTTCT CCTACACTCC GAGCGACAAC TATAACGGTC CGGATTCGTT CACCTATACT ATTACTGACA GCGATGGTGA TACGGATACA GCCACCGTGA CCATCGAGGT TACACCGGTC AACGACGCTC CCCAGGCCCT GGACAACGAG TACAGCACCG ACGAAGACTC CAGCGTGGGT GGCAATGTCA TCACCGACAC CGGATTTGAT GATGAGCTCA ATGTCACCGG CGTCGATTCC GATCCGGAAA ACGACCCCCT CACCGTGACC GCGGTTAACG GCACCGCCAT TGAGTCTGGC GACACCATCA CGCTCGCCAG CGGCGCCCTG TTGACCATGA ACAGCGACGG GACTTTCACC TACAACCCCA ACGGGCAATT TGAGGGCCTT GGCGGCGAAG GCAGCGAGAA TTCCGCCGGT TCAGACACCT TCACCTATAC CATTACCGAC GGCGACGCGA CTTCGACCGC TGACGTGACC ATCAATGTCA GCGGCCTCAA CGACGCTCCC CAGGCCCTGG ATAACCAGTA CAGCACCAAC GAAGACTCCA GCGTGGGTGG CAACGTCATC ACCGACACCG GATTTGATGA TGAGCTCAAC ACCACCGGGG TCGACTCCGA TCCGGAAAAC GATCCCCTCA CTGTGACCGC GGTCAATGGC ACCGCCATCA ACTCCGGCGA CACCATTACA CTCGCCAGCG GCGCCCTGTT GACCATGAAC AGTGACGGGA CCTTCACCTA CGATCCCAAC GGGCAATTTG AGGGCCTTGG CGGCGAAGGC AGCGAGAATT CCTCCGGTTC AGACACCTTC ACCTACACCA TCACTGACGG CGACGCGACC TCGACCGCTG ACGTGACCAT CAATGTCAGC GGCGTCAACG ACGCTCCCCA GGCCCTGGAC AACGAGTACA ACACCAACGA AGACTCCAGC GTAGGTGGCA ATGTCATCAC CGACTCCGGA TTTGATGATG AGCTCAATGT CACCGGGGTC GACTCCGATC CGGAAAACGA TCCCCTCACT GTGACCGCGG TCAACGGCAC CGCCATTGAG TCTGGCGACA CCATCACACT CGCCAGCGGC GCCCTGTTGA CTATGAACAG TGACGGGACC TTCACCTACG ATCCCAACGG GCAATTTGAG GGCCTTGGCG GCGAAGGCAG CGAGAATTCC TCCGGTTCAG ACACCTTCAC CTACACCATC ACTGACGGCG ACGCGACCTC GACCGCTGAC GTGACCATCA ATGTCAGCGG CGTCAACGAC GGTCCCCAGG CCCTGGATAA CCAGTACAGC ACCGACGAAG ACTCCAGCGT GGACGGCAAT GTCATCACCG ACATCGGAGT TGATGATGAG CTCAACACCA CCGGGGTCGA TTCTGATCCG GAAAACGACC CCCTCACTGT GACCGCGGTC AACGGCACCG CCATTGAGTC TGGCGACACC ATTACACTCG CCAGCGGCGC CCTGTTGACC ATGAACAGTG ACGGGACCTT CACCTACAAT CCCAACGGAC AGTTCGAAGG CCTTGGCGGT GAAGGCAGCG AGAATTCCTC CGGTTCAGAC ACCTTCACCT ACACCATCAC TGACGGCGAC GCGACCTCGA CCGCTGACGT GACCATCAAT GTCAGCGGCG TCAACGACGC TCCCCAGGCC CTGGACAACG AGTACAGCAC CGACGAAGAC TCCAGCGTAG GTGGCAATGT CATCACCGAC ACCGGATTTG ATGATGAGCT CAATGTCACC GGCGTCGATT CCGATCCGGA AAACGACCCC CTCACCGTGA CCGCGGTCAA CGGCACCGCC ATTGAGTCTG GCGACACCAT CACGCTCGCC AGCGGCGCCC TGTTGACCAT GAACAGCGAC GGGACCTTCA CCTACGATCC CAACGGACAG TTCGAAGGCC TTGGCGGCGA AGGCAGCGAG AATTCCGCCG GTTCAGACAC CTTCACCTAT ACCATTACCG ACGGCAACGC GACCTCGACC GCTGACGTCA CCATCAATGT CAGCGGCGTC AATGATCCGC CTATTGCAGT AGACGATGTA CCTTGTCCTG AGTCGGAAGA CTTCACCGCC AGCTTGTTTT TAAACGAAGG TCAAAACGGC TCCAGTGCTT CAAATTGGTC AATGACAAAT CTGTCTATTA TTGCCTACGA ATTCAACGGA GATAACGAGT CGGACACTTT AGCATACACC CAACAAGGAG TTGGTGTCCT GAGTGATGGG GAAAACAACC CTCCTAATAG ATTCAATGAA CTGGATTATA ACAGAGGTGA AGACAAATCG GAATCGCTAC AAATAAAATT CGATGGCTTT GTCAACACCT CGACTGTTAC ATTAGGCATG TTTTTTGAAA ATGAAGGCCC CAATAATGGT GAAGAAATAG GGCATTGGCA GGCATTATTG AACGGACATG TGATTGCTGA ATCTGACTTT GACTCCACAA ATGACTCATC TGGCACAAAA TCCATTACTA TAGACACCGG CGACAAGCTA TACAACGAGC TTATTTTCAC AGCTAAAGAA TATAGTGAAG GAGGGGGAAA TCTTGGTGTC AATTCAGACA GCTCTGATTA CTACATTAAA TCCATTTATT CCTCCGGCCC ATGTGACTGT AATGGCCCCT TAGCCACAAC CGAAGACGCT CCGACTTCAT CTTTAACTAC ACACATTCTT TCCAACGACT CTGATCCTGA AAATAATACG CTTTTTATCA CTAACATCGA TACCAATCTA ACGAACGGCA AGGTTTCCTT CGATGTCGAC ACCGAAGGAA ATATAACCAA TGTCGTCTAT GATCCTGATA ATTATTACAA CTACTTAAAT CCTGGCGAAA CTGCAACTGA CACATTTTCA TACACTATCA GCGACGGATA TGGGGGCACT GACACAGCGC AGGTTACAAT CAAGATCATT GGTATCAACG ACTACCCCTC AGCTGAAAAC GATATGCAAT CTCTCAACTA TGGCCACCTC CTTGCTACAA ACAATACAGG GGCTACGATA CTCTCCTCAG TCAATCTAGA CCCTGATGCA GAGACAACAT ATACAGCTAT AAACGAGGAC ATAGGCATCA AACCCGATGA AATGGCCTAT GATGGATCTG GATTACTTTG GGCTTTCGAC AATTCCAGCA AAAACTTTTA CACCATCAAC CCGTCTAATG GCGATATAAA ATTCCAATAC GAAGGAGTCA TACAATCTGA TGTCGACGGC ATATCTTTCC TTACTATAAA TGGCTCCGAG TATATGTTTG TTCTTGCAGG AAAAGATCTG TACACTCTCA ATCCTGACAA TGGCAACACG ATAATTCCAG AAAGCTACTC CATAGAAGGA GCTTCTAGTA ATCTTGCAGA CTTAGTTTCT CTCAACGGTA AATTATACAC CTATGCTTCT AATAACAATC TTTTTGAAAT ATCGCTTAAT CCAGACGGAA GTGTTGATTC AGTAAACACC TTTTCTTTAC CGAGTACTAT ATCCAGAGTC GACGGAATGG TTGGCGGTGA CGACGGTAGC CTATACCTTG TTTCCTCAAA AGGTCAGAGT GGTGGGACAG TGACCCCTCT AACCATAGAT AATGATGGAA ATATCACCAT TGAAAGTCCT TTCGATATAG ATGGGGCGCC ACTTGGCAAT CTCTGGGCCA TGGCTGGGAT GATAAATTTC AACACAGAAG TTAATGGAAA CGTTTTAGAC AACGACACCG ATCCGGAAAA CTCATTGTTG GAAGTCACTC TGGTAGACGG CCAAAGCGCG AACGTCGGTA CCACTGTTGA TGGTGACTTC GGCTCGTTGA CCCTGAACAG CGACGGGACG TACACCTACA CACTGGACAC GGATATAACC GCAGCCGGCC AGGACAGCTT CACGTATACC ATTTCTGACG GATACGGAGG ACTTTCCAGT GCCACGCTGA CCTTCAACGT CAACGGTTTT ATCCCAACTG GTTCCGGCTC GGTAATCACC GGGGACGAAA ACGACAACAT CCTTACCGGC ACAGTGAACA ATGACATCCT GTTTGGAGAT GAAGGAAACG ACAGCCTCAC CGGCGACGAT GGCGCTGACA CCTTTGTCTA CAGTGCTGAT GGCGGCGAGG GTCAGGACAC GATCCTCGAT TTCAATCCTG GAGAGGATAT CATCCGCCTG ACCGATGTCC TGGATAGCGA TACGGACGGA CTGCCGGATC TCAATGAACT GGCTGGATCA GCGCAAGAAG TAAGTGTTGC GGTCAATGGC AGTGATGTCA CCCTGACCAT CGCGGGAACA AATGGCAACA TGGACAGCAC CGTCACCCTG GACGGGATCA ACTCGGGAGC CTACGACTCG TACGACGGCG GAACGCTCCA GGACCTCATC GACAACGACC TGATCAAGGT CCAATACGAA TCCGGTTCTT TTGACAGCTA A
|
Protein sequence | MAENTMNVAQ MEGQIEEGGR ETSNAAAENL QTSQQTQQPI PVPEAGETRV VQVAPGDVIQ LGADVTEAEL IQQGGSVLLR FSDGGDLLLE DFINQTDAGE PPTLILADGS EVTADQIIAA MTPAPDAAPA AGEGPTSGGA GEYREDVGNL IEGVNRLDGL GPDALAADAA LVPEAEGEGI LPEVEEDALP LAADDAFTIE EDPESPLEGD LSLNDDPGDA PATFAILDGP ENGTATVNPD GTFSYTPSDN YNGPDSFTYT ITDSDGDTDT ATVTIEVTPV NDAPQALDNE YSTDEDSSVG GNVITDTGFD DELNVTGVDS DPENDPLTVT AVNGTAIESG DTITLASGAL LTMNSDGTFT YNPNGQFEGL GGEGSENSAG SDTFTYTITD GDATSTADVT INVSGLNDAP QALDNQYSTN EDSSVGGNVI TDTGFDDELN TTGVDSDPEN DPLTVTAVNG TAINSGDTIT LASGALLTMN SDGTFTYDPN GQFEGLGGEG SENSSGSDTF TYTITDGDAT STADVTINVS GVNDAPQALD NEYNTNEDSS VGGNVITDSG FDDELNVTGV DSDPENDPLT VTAVNGTAIE SGDTITLASG ALLTMNSDGT FTYDPNGQFE GLGGEGSENS SGSDTFTYTI TDGDATSTAD VTINVSGVND GPQALDNQYS TDEDSSVDGN VITDIGVDDE LNTTGVDSDP ENDPLTVTAV NGTAIESGDT ITLASGALLT MNSDGTFTYN PNGQFEGLGG EGSENSSGSD TFTYTITDGD ATSTADVTIN VSGVNDAPQA LDNEYSTDED SSVGGNVITD TGFDDELNVT GVDSDPENDP LTVTAVNGTA IESGDTITLA SGALLTMNSD GTFTYDPNGQ FEGLGGEGSE NSAGSDTFTY TITDGNATST ADVTINVSGV NDPPIAVDDV PCPESEDFTA SLFLNEGQNG SSASNWSMTN LSIIAYEFNG DNESDTLAYT QQGVGVLSDG ENNPPNRFNE LDYNRGEDKS ESLQIKFDGF VNTSTVTLGM FFENEGPNNG EEIGHWQALL NGHVIAESDF DSTNDSSGTK SITIDTGDKL YNELIFTAKE YSEGGGNLGV NSDSSDYYIK SIYSSGPCDC NGPLATTEDA PTSSLTTHIL SNDSDPENNT LFITNIDTNL TNGKVSFDVD TEGNITNVVY DPDNYYNYLN PGETATDTFS YTISDGYGGT DTAQVTIKII GINDYPSAEN DMQSLNYGHL LATNNTGATI LSSVNLDPDA ETTYTAINED IGIKPDEMAY DGSGLLWAFD NSSKNFYTIN PSNGDIKFQY EGVIQSDVDG ISFLTINGSE YMFVLAGKDL YTLNPDNGNT IIPESYSIEG ASSNLADLVS LNGKLYTYAS NNNLFEISLN PDGSVDSVNT FSLPSTISRV DGMVGGDDGS LYLVSSKGQS GGTVTPLTID NDGNITIESP FDIDGAPLGN LWAMAGMINF NTEVNGNVLD NDTDPENSLL EVTLVDGQSA NVGTTVDGDF GSLTLNSDGT YTYTLDTDIT AAGQDSFTYT ISDGYGGLSS ATLTFNVNGF IPTGSGSVIT GDENDNILTG TVNNDILFGD EGNDSLTGDD GADTFVYSAD GGEGQDTILD FNPGEDIIRL TDVLDSDTDG LPDLNELAGS AQEVSVAVNG SDVTLTIAGT NGNMDSTVTL DGINSGAYDS YDGGTLQDLI DNDLIKVQYE SGSFDS
|
| |