Gene Daud_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2108 
Symbol 
ID6027646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2226316 
End bp2229303 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content67% 
IMG OID641594928 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001718229 
Protein GI169832247 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.557195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTACC AAGGAGCGCG CCGGCGGTGG CTGGTGCTGG TGATCACCGC GATCTTTGTT 
TTAAGCCTGG CCCTGCCAGC CGCGGCCGCC GGGCCGGACC GGGTGAAGCC CGGAGCGGGC
GGGGAACCGC CGGGCGACTT TGTGCCCGGA GAGGTAATCG TCAAGTTCAA GGAGGGTGTG
CGGGCGGCCG CGACGATGCA GACCCTGGCG GCCAAGCACC GGGCGTTCGG GCTGGCCGCG
GTGCGGGTGC TGCCCTACGA GGCCGCGCTG TTCACCACCA CGACCGATGT AACGGCGGCG
GTGGCCGCCC TGCAGCGCGA CCCGCGGGTC GAGTTCGCCC AGCCGAACTA CATCTACCGC
GCCCTTGGCG CCCCCGACGA TCCGCTGTGG GACCAGCAGT GGGGGATGCA TGCCTCTGAC
GGGCCGCCCC CGCACCACCC TCACGGCGTA CGGGCGCTGG AGGCCTGGAC ACACACCAAG
GGCTCGGCCG ACATTGTGGT CGCCGTGATC GACACCGGAA TCGACTACAC TCACGAGGAC
TTAGCGGCCA ACATGTGGAC CAACCCGGGT GAAATCCTCG GGGATAGAAT CGACAACGAC
GGTAACAGCT TTGTGGACGA CTACTATGGT TATGATTTCA TCGGAGCAAA TGCCCGCAAC
CCCCAACCGG ACTCCGACCC GCTGGACGAT GACGGTCACG GCACCCACGT GGCCGGGATT
GTAGCCGCCA CGGCGAACAA CGCCAAGGGC ATCGCCGGCA CCGCCCCGGG TGTGCGGCTC
ATGGCGGTGA AGGCGCTTGA CTCCGGAGGC TTCGGCACCA CTGCCGCCAT CGTAAACGCT
ATTAATTACG CGGCCACTAA CGGGGCGCAG GTGGTGAACA TGAGCTTTGG TGGGACAGGG
TTTGACCCCC TGCAGTATAA AGCCATCGCC GCGCACCCGG GGGTACTCTT CGTAGCTGCG
GCCGGGAACG GCGGATCTGA TGGCATTGGT GACAACAACG ACACTAACCC CGTCTCCCCG
GCCAGCTTCA CCATTGATTG GAACATTGAT ACAAATGACG ATGGAACCTC TGAACACTTC
CCGGCCCTGC CCCACCTCAT AAGCGTAGCC GCCCTGGCTC CGAACGGCAA CCTGACCACC
TTCTCCAACT TCGGCGCCAC CTCGGTTGAC CTGGCCGCGC CGGGCGACGC GATCGTGAGT
ACGGTGCCGC AGTGGGACGG CACCCCTCCC TCTCCCTACG CCGCTTGGGA CGGCACCTCG
ATGGCCGCGC CGTTTGTCGC CGCCGGCGGG GCGTTGGTCC TTTCGCTGCG CCCCGACCTG
GCACCCGCGA GCGTGATCGA CCTGCTCAAC AACAACGTCA CCGAGTTGGC TTCCGCCCTG
ACCGGTAAAG TGGCCTCGGG CGGCACCCTC AACCTGGCCC GAGCCCTGGC CGCCGTCCCG
CCGGGCGTGA AAAGCACTGT ACCGGCGCAT GGTGCCACCG GGGTAGCGGT CAACACCAAC
ATCACCGTCA CCTTCAGCGA AAGCGTGACC AAAGGGGTCT ACTTCGACGG CATCACCATC
AGCGGTGGCG GGACAACGGT GAGCCACACC TACGGCCTCA GCGGCAGCAT GCTCACCTTG
AACCCGGACG CCAACCTGGC CCACAGTACG GTTTATACCG TCACGATCCC GGCCGGGGCG
GTCCAAGACG CCGCCGGCAA CCCGTCAGAC GCCCACAGCT TCAGCTTCAC TACCCAGGCC
GCAGGCGGCG GAGGCGGCGG CGGCGGTGGA GCGCCCGCCC CTCCGGCACC TCCGGAAGCA
CCGGGACCAC CGGCCGGCAC TGGTGAATTC ACCGCCACCG GCGGGGCGCA AAGTGTGAGT
CTCCTGGACG GCCAAGTAAC CCTGGACCTC CCGGCCGGCG CCCTGCCCGA AGGGGCGAAG
GTCACCGTCA CGCTGGCCGC CGACACCCCG GAGAATCTCC CGGCCGGCGC CAAAGCGGTC
AGCGCGGTGT TCAGCTTTAA GAGTACCGCA CCTCTGGCCA AACCGGTCCG GGTCTCCATC
CGGTACGAGG CGGACAAACT GGGCGGCCTC GACCCGCAGG CGCTGATGGT CTTCCGGGAG
AACCCGGACG GCACCTGGCA AAGAGTGGGT GGCAAACTCG ACCGCGCCGC CCAGGCGGTT
GTGGTCGAGC TCGACGGCTT CTCCAGCTAC ACCGTCCTCG GTACGCCGAA GACCTTCGGG
GACATCAAGG GCCACTGGGC GCAAGCCGAC ATCGAACTGC TGGCGGCCCG CGGCCTGGTC
CAGGGCCGGG CGGCCGGTAA GTTCGCCCCC GGAGCGCCGG TGACCCGGGC CGAGATGGCC
GCGCTCTTAG TACGGCTGAC CGGTGCAAAG GAAGTAACTC CGGCGCAGCC GGCCTTCACC
GACGTAGCCC CCGGTGCCTG GTACTACAGT GCAATCGAAA CGGCTGTCCG GGCCGGACTG
TTCAAGGGCT ACGCCGACGG CAGCTTCCAG CCCGACGCCA CCCTAACCCG CGAGCAACTG
GCGGCGCTGG CCGTACGCCT TACCGGAGCT GCGACCGGCA CGACCCAACT ACCCTTCGCC
GACCGGGCCG CCATCGCCCC CTGGGCCGAG GAAGCGGTCG CCGCCGCCTA CGCCCAAGGG
CTGCTGCGCG GCGTCTCCGA CACCGAGTTT GCCCCGCAAA TGTCGGTGAC CCGGGCCCAG
GCCGCGACCA TCATGGTCCG GCTGGCCGAA AGGAAGGGGC TGTTCGAGGT AACGATCACG
GCTACCGGCA CCCTGGTGTG GAACACGCTG GTCGGCGGCT TCTGGGAACT GGCCGCCGAC
CAGGAAACCT ACGTGCTCCT GCCCGACCCG CGGCACAAAG CGGCCGCGGC CCAACTGAAG
CAGTTCGAGA ACCAGGAGAT CACCGTGACC GGCTACATTC AGACCGGACC GAACATCTAC
ATGCGCGGCC CGCTGCTCCG CATCCTGAAT GTTACCCCAA CCGGGTAA
 
Protein sequence
MIYQGARRRW LVLVITAIFV LSLALPAAAA GPDRVKPGAG GEPPGDFVPG EVIVKFKEGV 
RAAATMQTLA AKHRAFGLAA VRVLPYEAAL FTTTTDVTAA VAALQRDPRV EFAQPNYIYR
ALGAPDDPLW DQQWGMHASD GPPPHHPHGV RALEAWTHTK GSADIVVAVI DTGIDYTHED
LAANMWTNPG EILGDRIDND GNSFVDDYYG YDFIGANARN PQPDSDPLDD DGHGTHVAGI
VAATANNAKG IAGTAPGVRL MAVKALDSGG FGTTAAIVNA INYAATNGAQ VVNMSFGGTG
FDPLQYKAIA AHPGVLFVAA AGNGGSDGIG DNNDTNPVSP ASFTIDWNID TNDDGTSEHF
PALPHLISVA ALAPNGNLTT FSNFGATSVD LAAPGDAIVS TVPQWDGTPP SPYAAWDGTS
MAAPFVAAGG ALVLSLRPDL APASVIDLLN NNVTELASAL TGKVASGGTL NLARALAAVP
PGVKSTVPAH GATGVAVNTN ITVTFSESVT KGVYFDGITI SGGGTTVSHT YGLSGSMLTL
NPDANLAHST VYTVTIPAGA VQDAAGNPSD AHSFSFTTQA AGGGGGGGGG APAPPAPPEA
PGPPAGTGEF TATGGAQSVS LLDGQVTLDL PAGALPEGAK VTVTLAADTP ENLPAGAKAV
SAVFSFKSTA PLAKPVRVSI RYEADKLGGL DPQALMVFRE NPDGTWQRVG GKLDRAAQAV
VVELDGFSSY TVLGTPKTFG DIKGHWAQAD IELLAARGLV QGRAAGKFAP GAPVTRAEMA
ALLVRLTGAK EVTPAQPAFT DVAPGAWYYS AIETAVRAGL FKGYADGSFQ PDATLTREQL
AALAVRLTGA ATGTTQLPFA DRAAIAPWAE EAVAAAYAQG LLRGVSDTEF APQMSVTRAQ
AATIMVRLAE RKGLFEVTIT ATGTLVWNTL VGGFWELAAD QETYVLLPDP RHKAAAAQLK
QFENQEITVT GYIQTGPNIY MRGPLLRILN VTPTG