Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_0199 |
Symbol | |
ID | 6027656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 227092 |
End bp | 230157 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641593054 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001716393 |
Protein GI | 169830411 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATC AATTCTTTGA AAAGCCAATC CTTAACTCTC CCTATGAGTA TCCGGCGCGG CACTGGGAAC TTGATGATCA GGGCCAGCCT ACGCAACGAA TCATCGACAG GCGCCGTCGC GCCGAGTTCA TCACGCCGAT TCCGAAGCCC AGGAAACGCA AGGATTCACT GGACCAGCAA CAGATGGTTT TCGATGAAGG CAAAGGGCTT TCGACGAAAG CTCAGCAGTA TGATCCTACG TCCATTATCA ATGAGATTCG ACGCCATGTA GACCAGTGGC GCAGCAGTTC CCCCGGTGAT TGGCGCGTCA CGCCCGAAAC CGCCCGGCTG CTTCACCACT GGCGACATCA TAAGTTCAGC AACATCCAGC CGTTTTTCTG CCAGGTCGAA GCGGTTGAGA CGGTCATCTG GTTGACGGAA GTGGCTCCGA AAATCGGCAA GATCGGCCAG CCATTCCTTG AACACCTTAT CAATGCTAAC AATGAGGCCA ACCCGGGCCT GCTGCGTCTG GCCCTGAAGC TGGCTACCGG TGCCGGCAAA ACCACAGTAA TGGCCATGCT GATTGCCTGG CAGACCATCA ATGCCGTGCG CCAGCCCAAT AGCAAACGAT TCACCCGGGG GTTCTTGGTT GTCTGCCCGG GTCTGACTAT CCGCGACCGG CTCCGCGTGC TTCAGCCCAA TGATCCGGAC AGCTACTACC AGAGCCGGGA ACTCGTCCCC AACGATATGC TCCGCGATCT GGAACGGGCT AAGATCGTCA TCACCAACTA CCACGCCTTC AAACTCCGCG AACGCATGGA GCTGTCCAAG GGCGGCCGGT CGCTGCTTCA GGGCCGGGGC GCGGCGCTCA ACACGTTGGA AACCGAGGGA CAGATGCTCC AGCGGGTGAT GCCCGAGCTG ATGGGTATGA AGAACATCCT GGTGCTCAAC GACGAGGCGC ATCATTGTTA CCGTGAAAAG CCCAAAAGCG ATGCCGAGGG CGAGCTGAAA GGCGATGACC GGAGGGAAGC CGAGAAAAAC AACGAAGCTG CCCGTGTTTG GATCTCCGGC CTCGAAACCG TCAACCGCAA ACTCGGTATC ACGCGCGTCA TTGACCTATC GGCAACGCCG TTCTTCCTCC GGGGCTCCGG CTACGCTGAG GGTACGCTGT TCCCCTGGAC GGTGAGTGAC TTCTCGCTGA TGGACGCCAT CGAATGCGGC ATCGTCAAAC TGCCGCGCGT TCCTGTGGCC GATAACATCC CTGGCGGGGA GATGCCCAAG TTCCGCAACC TCTGGGAGCA CATTCGCACG CGGATGCCTA AGAAAGGCCG GGGTAAGGCA AAGAGCCTTG ATCCGCTGAG CCTGCCGGTC GAGCTGCAAA CTGCGCTCGA TGCCCTTTAT GGGCATTACG AAAAGACATA TGAACTATGG CAGAAAAGCG GTATTAGAGT CCCGCCCTGC TTCATTGTGG TCTGCAACAA TACGTCCACC TCCAAACTGG TGTACGACTA TATTTCCGGC TTCTACCGGG AGAACGAAGA CGGTTCGACC ACCCTTGAGA ACGGACGCCT GGCACTCTTT CGGAACTTCG ACGAGCACGG CAACCCACTC CCCCGTCCAC GGACGCTGTT GATAGACAGT GAGCAGCTCG AATCCGGCGA AGCACTGGAC GATAACTTCC GCGCTATGGC TGCCGATGCG ATCGAGCGTT TCCGACGCGA GATCATAGAA CGTACCGGCG ACCGCCGCCA GGCCGAGAAC CTGACAGATC AGGAGTTGCT GCGGGAAGTC ATGAATACCG TTGGTAAGGA AGGTCGCCTC GGCGAGTCGA TTCGCTGCGT GGTATCGGTT TCCATGCTTA CTGAGGGCTG GGATGCTAAC ACCGTTACGC ATGTGCTGGG CGTGCGTGCC TTTGGCACTC AGCTCTTGTG CGAGCAGGTC ATCGGCCGTG CGTTGCGCCG CCAGTCCTAT GACCTCAATG AGGACGGCTT GTTTAACGTA GAATATGCTG ACATACTGGG GATACCGTTC GACTTCACCG CCAAGCCGGT TATTGCGCCC CCGCAACCGC CCCGTGAGAC CATCCAGGTC AAGGCTGTGC GACCCGACCG CGACCACCTC GAGATCCGTT TTCCGCGCGT CGAAGGTTAC CGTGTCGAAC TGCCCGAGGA AAGGCTTACC GCCAAATTCA ACGACGACTC CATTCTTGAG CTGACCCCAG ATATCGTCGG CCCCTCGATC ACCAAGAACG CGGGGATTAT TGGTGAAGAC GTCGACCTCA GCCTGCAGCA TCTGGAAGAT ATACGGCGGT CCACGCTGCT GTTTCACCTT ACCAAACGTT TGCTGTACAC CAAGTGGCGG GACCACGGGG AAGAGCCCAG ACTCCACCTG TTCGGGCAGC TTAAGCGAAT CACCAGGCAG TGGCTCGATA ACTATTTGGT CTGCAAGGGC GGCACCTTCC CCGCGCAACT AATGTATCAG GAGTTGGCGG ACATGGCTTG TGAGCGCATA ACCGCCGGCA TCACCCGTTC GTTGGTGGGT GAACGGCCCA TTAAGGCTAT ACTGGACCCC TATAACCCCA CTGGCTCCAC TATCCACGTG AACTTCAATA CCTCGAAGAA AAACCGCTGG GAGACCGACC CACGCCGCTG TCACATCAAT TGGGTCATCC TCGACAGCGA CTGGGAGGCC GAGTTCTGCC GCGTTGCTGA GTCCCATCCA CGGGTCAAGG CTTATGTCAA GAACCACAAC CTCGGACTGG AAGTACCGTA CCGCTACGGT TCGGAGGTAC GGAAGTACAT CCCCGACTTC ATCGTCCTAG TCGACGGCGG GCACGGCGAG GACAACCTGC TCCACCTGGT CGTCGAGATC AAGGGGTACC GGCGCGAGGA CGCCAAGGAG AAGAAGACCG CCATGGAGAC CTACTGGATA CCCGGGGTCA ATAAGCTCAA ACAGTATGGC CGCTGGGCGT TTGCCGAGTT CACCGAGGTC TACCGCATCG AGGCCGACTT TGAGGCCAGG GTCGAAGCCG AGTTCAACAA GATGATCGAT TCAGTCACCA CTCGGCCGAC GGTGGAGGGG AGTTAA
|
Protein sequence | MDNQFFEKPI LNSPYEYPAR HWELDDQGQP TQRIIDRRRR AEFITPIPKP RKRKDSLDQQ QMVFDEGKGL STKAQQYDPT SIINEIRRHV DQWRSSSPGD WRVTPETARL LHHWRHHKFS NIQPFFCQVE AVETVIWLTE VAPKIGKIGQ PFLEHLINAN NEANPGLLRL ALKLATGAGK TTVMAMLIAW QTINAVRQPN SKRFTRGFLV VCPGLTIRDR LRVLQPNDPD SYYQSRELVP NDMLRDLERA KIVITNYHAF KLRERMELSK GGRSLLQGRG AALNTLETEG QMLQRVMPEL MGMKNILVLN DEAHHCYREK PKSDAEGELK GDDRREAEKN NEAARVWISG LETVNRKLGI TRVIDLSATP FFLRGSGYAE GTLFPWTVSD FSLMDAIECG IVKLPRVPVA DNIPGGEMPK FRNLWEHIRT RMPKKGRGKA KSLDPLSLPV ELQTALDALY GHYEKTYELW QKSGIRVPPC FIVVCNNTST SKLVYDYISG FYRENEDGST TLENGRLALF RNFDEHGNPL PRPRTLLIDS EQLESGEALD DNFRAMAADA IERFRREIIE RTGDRRQAEN LTDQELLREV MNTVGKEGRL GESIRCVVSV SMLTEGWDAN TVTHVLGVRA FGTQLLCEQV IGRALRRQSY DLNEDGLFNV EYADILGIPF DFTAKPVIAP PQPPRETIQV KAVRPDRDHL EIRFPRVEGY RVELPEERLT AKFNDDSILE LTPDIVGPSI TKNAGIIGED VDLSLQHLED IRRSTLLFHL TKRLLYTKWR DHGEEPRLHL FGQLKRITRQ WLDNYLVCKG GTFPAQLMYQ ELADMACERI TAGITRSLVG ERPIKAILDP YNPTGSTIHV NFNTSKKNRW ETDPRRCHIN WVILDSDWEA EFCRVAESHP RVKAYVKNHN LGLEVPYRYG SEVRKYIPDF IVLVDGGHGE DNLLHLVVEI KGYRREDAKE KKTAMETYWI PGVNKLKQYG RWAFAEFTEV YRIEADFEAR VEAEFNKMID SVTTRPTVEG S
|
| |