Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_2108 |
Symbol | |
ID | 6027646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 2226316 |
End bp | 2229303 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641594928 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001718229 |
Protein GI | 169832247 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.557195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTACC AAGGAGCGCG CCGGCGGTGG CTGGTGCTGG TGATCACCGC GATCTTTGTT TTAAGCCTGG CCCTGCCAGC CGCGGCCGCC GGGCCGGACC GGGTGAAGCC CGGAGCGGGC GGGGAACCGC CGGGCGACTT TGTGCCCGGA GAGGTAATCG TCAAGTTCAA GGAGGGTGTG CGGGCGGCCG CGACGATGCA GACCCTGGCG GCCAAGCACC GGGCGTTCGG GCTGGCCGCG GTGCGGGTGC TGCCCTACGA GGCCGCGCTG TTCACCACCA CGACCGATGT AACGGCGGCG GTGGCCGCCC TGCAGCGCGA CCCGCGGGTC GAGTTCGCCC AGCCGAACTA CATCTACCGC GCCCTTGGCG CCCCCGACGA TCCGCTGTGG GACCAGCAGT GGGGGATGCA TGCCTCTGAC GGGCCGCCCC CGCACCACCC TCACGGCGTA CGGGCGCTGG AGGCCTGGAC ACACACCAAG GGCTCGGCCG ACATTGTGGT CGCCGTGATC GACACCGGAA TCGACTACAC TCACGAGGAC TTAGCGGCCA ACATGTGGAC CAACCCGGGT GAAATCCTCG GGGATAGAAT CGACAACGAC GGTAACAGCT TTGTGGACGA CTACTATGGT TATGATTTCA TCGGAGCAAA TGCCCGCAAC CCCCAACCGG ACTCCGACCC GCTGGACGAT GACGGTCACG GCACCCACGT GGCCGGGATT GTAGCCGCCA CGGCGAACAA CGCCAAGGGC ATCGCCGGCA CCGCCCCGGG TGTGCGGCTC ATGGCGGTGA AGGCGCTTGA CTCCGGAGGC TTCGGCACCA CTGCCGCCAT CGTAAACGCT ATTAATTACG CGGCCACTAA CGGGGCGCAG GTGGTGAACA TGAGCTTTGG TGGGACAGGG TTTGACCCCC TGCAGTATAA AGCCATCGCC GCGCACCCGG GGGTACTCTT CGTAGCTGCG GCCGGGAACG GCGGATCTGA TGGCATTGGT GACAACAACG ACACTAACCC CGTCTCCCCG GCCAGCTTCA CCATTGATTG GAACATTGAT ACAAATGACG ATGGAACCTC TGAACACTTC CCGGCCCTGC CCCACCTCAT AAGCGTAGCC GCCCTGGCTC CGAACGGCAA CCTGACCACC TTCTCCAACT TCGGCGCCAC CTCGGTTGAC CTGGCCGCGC CGGGCGACGC GATCGTGAGT ACGGTGCCGC AGTGGGACGG CACCCCTCCC TCTCCCTACG CCGCTTGGGA CGGCACCTCG ATGGCCGCGC CGTTTGTCGC CGCCGGCGGG GCGTTGGTCC TTTCGCTGCG CCCCGACCTG GCACCCGCGA GCGTGATCGA CCTGCTCAAC AACAACGTCA CCGAGTTGGC TTCCGCCCTG ACCGGTAAAG TGGCCTCGGG CGGCACCCTC AACCTGGCCC GAGCCCTGGC CGCCGTCCCG CCGGGCGTGA AAAGCACTGT ACCGGCGCAT GGTGCCACCG GGGTAGCGGT CAACACCAAC ATCACCGTCA CCTTCAGCGA AAGCGTGACC AAAGGGGTCT ACTTCGACGG CATCACCATC AGCGGTGGCG GGACAACGGT GAGCCACACC TACGGCCTCA GCGGCAGCAT GCTCACCTTG AACCCGGACG CCAACCTGGC CCACAGTACG GTTTATACCG TCACGATCCC GGCCGGGGCG GTCCAAGACG CCGCCGGCAA CCCGTCAGAC GCCCACAGCT TCAGCTTCAC TACCCAGGCC GCAGGCGGCG GAGGCGGCGG CGGCGGTGGA GCGCCCGCCC CTCCGGCACC TCCGGAAGCA CCGGGACCAC CGGCCGGCAC TGGTGAATTC ACCGCCACCG GCGGGGCGCA AAGTGTGAGT CTCCTGGACG GCCAAGTAAC CCTGGACCTC CCGGCCGGCG CCCTGCCCGA AGGGGCGAAG GTCACCGTCA CGCTGGCCGC CGACACCCCG GAGAATCTCC CGGCCGGCGC CAAAGCGGTC AGCGCGGTGT TCAGCTTTAA GAGTACCGCA CCTCTGGCCA AACCGGTCCG GGTCTCCATC CGGTACGAGG CGGACAAACT GGGCGGCCTC GACCCGCAGG CGCTGATGGT CTTCCGGGAG AACCCGGACG GCACCTGGCA AAGAGTGGGT GGCAAACTCG ACCGCGCCGC CCAGGCGGTT GTGGTCGAGC TCGACGGCTT CTCCAGCTAC ACCGTCCTCG GTACGCCGAA GACCTTCGGG GACATCAAGG GCCACTGGGC GCAAGCCGAC ATCGAACTGC TGGCGGCCCG CGGCCTGGTC CAGGGCCGGG CGGCCGGTAA GTTCGCCCCC GGAGCGCCGG TGACCCGGGC CGAGATGGCC GCGCTCTTAG TACGGCTGAC CGGTGCAAAG GAAGTAACTC CGGCGCAGCC GGCCTTCACC GACGTAGCCC CCGGTGCCTG GTACTACAGT GCAATCGAAA CGGCTGTCCG GGCCGGACTG TTCAAGGGCT ACGCCGACGG CAGCTTCCAG CCCGACGCCA CCCTAACCCG CGAGCAACTG GCGGCGCTGG CCGTACGCCT TACCGGAGCT GCGACCGGCA CGACCCAACT ACCCTTCGCC GACCGGGCCG CCATCGCCCC CTGGGCCGAG GAAGCGGTCG CCGCCGCCTA CGCCCAAGGG CTGCTGCGCG GCGTCTCCGA CACCGAGTTT GCCCCGCAAA TGTCGGTGAC CCGGGCCCAG GCCGCGACCA TCATGGTCCG GCTGGCCGAA AGGAAGGGGC TGTTCGAGGT AACGATCACG GCTACCGGCA CCCTGGTGTG GAACACGCTG GTCGGCGGCT TCTGGGAACT GGCCGCCGAC CAGGAAACCT ACGTGCTCCT GCCCGACCCG CGGCACAAAG CGGCCGCGGC CCAACTGAAG CAGTTCGAGA ACCAGGAGAT CACCGTGACC GGCTACATTC AGACCGGACC GAACATCTAC ATGCGCGGCC CGCTGCTCCG CATCCTGAAT GTTACCCCAA CCGGGTAA
|
Protein sequence | MIYQGARRRW LVLVITAIFV LSLALPAAAA GPDRVKPGAG GEPPGDFVPG EVIVKFKEGV RAAATMQTLA AKHRAFGLAA VRVLPYEAAL FTTTTDVTAA VAALQRDPRV EFAQPNYIYR ALGAPDDPLW DQQWGMHASD GPPPHHPHGV RALEAWTHTK GSADIVVAVI DTGIDYTHED LAANMWTNPG EILGDRIDND GNSFVDDYYG YDFIGANARN PQPDSDPLDD DGHGTHVAGI VAATANNAKG IAGTAPGVRL MAVKALDSGG FGTTAAIVNA INYAATNGAQ VVNMSFGGTG FDPLQYKAIA AHPGVLFVAA AGNGGSDGIG DNNDTNPVSP ASFTIDWNID TNDDGTSEHF PALPHLISVA ALAPNGNLTT FSNFGATSVD LAAPGDAIVS TVPQWDGTPP SPYAAWDGTS MAAPFVAAGG ALVLSLRPDL APASVIDLLN NNVTELASAL TGKVASGGTL NLARALAAVP PGVKSTVPAH GATGVAVNTN ITVTFSESVT KGVYFDGITI SGGGTTVSHT YGLSGSMLTL NPDANLAHST VYTVTIPAGA VQDAAGNPSD AHSFSFTTQA AGGGGGGGGG APAPPAPPEA PGPPAGTGEF TATGGAQSVS LLDGQVTLDL PAGALPEGAK VTVTLAADTP ENLPAGAKAV SAVFSFKSTA PLAKPVRVSI RYEADKLGGL DPQALMVFRE NPDGTWQRVG GKLDRAAQAV VVELDGFSSY TVLGTPKTFG DIKGHWAQAD IELLAARGLV QGRAAGKFAP GAPVTRAEMA ALLVRLTGAK EVTPAQPAFT DVAPGAWYYS AIETAVRAGL FKGYADGSFQ PDATLTREQL AALAVRLTGA ATGTTQLPFA DRAAIAPWAE EAVAAAYAQG LLRGVSDTEF APQMSVTRAQ AATIMVRLAE RKGLFEVTIT ATGTLVWNTL VGGFWELAAD QETYVLLPDP RHKAAAAQLK QFENQEITVT GYIQTGPNIY MRGPLLRILN VTPTG
|
| |