Gene Daud_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0199 
Symbol 
ID6027656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp227092 
End bp230157 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content58% 
IMG OID641593054 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001716393 
Protein GI169830411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATC AATTCTTTGA AAAGCCAATC CTTAACTCTC CCTATGAGTA TCCGGCGCGG 
CACTGGGAAC TTGATGATCA GGGCCAGCCT ACGCAACGAA TCATCGACAG GCGCCGTCGC
GCCGAGTTCA TCACGCCGAT TCCGAAGCCC AGGAAACGCA AGGATTCACT GGACCAGCAA
CAGATGGTTT TCGATGAAGG CAAAGGGCTT TCGACGAAAG CTCAGCAGTA TGATCCTACG
TCCATTATCA ATGAGATTCG ACGCCATGTA GACCAGTGGC GCAGCAGTTC CCCCGGTGAT
TGGCGCGTCA CGCCCGAAAC CGCCCGGCTG CTTCACCACT GGCGACATCA TAAGTTCAGC
AACATCCAGC CGTTTTTCTG CCAGGTCGAA GCGGTTGAGA CGGTCATCTG GTTGACGGAA
GTGGCTCCGA AAATCGGCAA GATCGGCCAG CCATTCCTTG AACACCTTAT CAATGCTAAC
AATGAGGCCA ACCCGGGCCT GCTGCGTCTG GCCCTGAAGC TGGCTACCGG TGCCGGCAAA
ACCACAGTAA TGGCCATGCT GATTGCCTGG CAGACCATCA ATGCCGTGCG CCAGCCCAAT
AGCAAACGAT TCACCCGGGG GTTCTTGGTT GTCTGCCCGG GTCTGACTAT CCGCGACCGG
CTCCGCGTGC TTCAGCCCAA TGATCCGGAC AGCTACTACC AGAGCCGGGA ACTCGTCCCC
AACGATATGC TCCGCGATCT GGAACGGGCT AAGATCGTCA TCACCAACTA CCACGCCTTC
AAACTCCGCG AACGCATGGA GCTGTCCAAG GGCGGCCGGT CGCTGCTTCA GGGCCGGGGC
GCGGCGCTCA ACACGTTGGA AACCGAGGGA CAGATGCTCC AGCGGGTGAT GCCCGAGCTG
ATGGGTATGA AGAACATCCT GGTGCTCAAC GACGAGGCGC ATCATTGTTA CCGTGAAAAG
CCCAAAAGCG ATGCCGAGGG CGAGCTGAAA GGCGATGACC GGAGGGAAGC CGAGAAAAAC
AACGAAGCTG CCCGTGTTTG GATCTCCGGC CTCGAAACCG TCAACCGCAA ACTCGGTATC
ACGCGCGTCA TTGACCTATC GGCAACGCCG TTCTTCCTCC GGGGCTCCGG CTACGCTGAG
GGTACGCTGT TCCCCTGGAC GGTGAGTGAC TTCTCGCTGA TGGACGCCAT CGAATGCGGC
ATCGTCAAAC TGCCGCGCGT TCCTGTGGCC GATAACATCC CTGGCGGGGA GATGCCCAAG
TTCCGCAACC TCTGGGAGCA CATTCGCACG CGGATGCCTA AGAAAGGCCG GGGTAAGGCA
AAGAGCCTTG ATCCGCTGAG CCTGCCGGTC GAGCTGCAAA CTGCGCTCGA TGCCCTTTAT
GGGCATTACG AAAAGACATA TGAACTATGG CAGAAAAGCG GTATTAGAGT CCCGCCCTGC
TTCATTGTGG TCTGCAACAA TACGTCCACC TCCAAACTGG TGTACGACTA TATTTCCGGC
TTCTACCGGG AGAACGAAGA CGGTTCGACC ACCCTTGAGA ACGGACGCCT GGCACTCTTT
CGGAACTTCG ACGAGCACGG CAACCCACTC CCCCGTCCAC GGACGCTGTT GATAGACAGT
GAGCAGCTCG AATCCGGCGA AGCACTGGAC GATAACTTCC GCGCTATGGC TGCCGATGCG
ATCGAGCGTT TCCGACGCGA GATCATAGAA CGTACCGGCG ACCGCCGCCA GGCCGAGAAC
CTGACAGATC AGGAGTTGCT GCGGGAAGTC ATGAATACCG TTGGTAAGGA AGGTCGCCTC
GGCGAGTCGA TTCGCTGCGT GGTATCGGTT TCCATGCTTA CTGAGGGCTG GGATGCTAAC
ACCGTTACGC ATGTGCTGGG CGTGCGTGCC TTTGGCACTC AGCTCTTGTG CGAGCAGGTC
ATCGGCCGTG CGTTGCGCCG CCAGTCCTAT GACCTCAATG AGGACGGCTT GTTTAACGTA
GAATATGCTG ACATACTGGG GATACCGTTC GACTTCACCG CCAAGCCGGT TATTGCGCCC
CCGCAACCGC CCCGTGAGAC CATCCAGGTC AAGGCTGTGC GACCCGACCG CGACCACCTC
GAGATCCGTT TTCCGCGCGT CGAAGGTTAC CGTGTCGAAC TGCCCGAGGA AAGGCTTACC
GCCAAATTCA ACGACGACTC CATTCTTGAG CTGACCCCAG ATATCGTCGG CCCCTCGATC
ACCAAGAACG CGGGGATTAT TGGTGAAGAC GTCGACCTCA GCCTGCAGCA TCTGGAAGAT
ATACGGCGGT CCACGCTGCT GTTTCACCTT ACCAAACGTT TGCTGTACAC CAAGTGGCGG
GACCACGGGG AAGAGCCCAG ACTCCACCTG TTCGGGCAGC TTAAGCGAAT CACCAGGCAG
TGGCTCGATA ACTATTTGGT CTGCAAGGGC GGCACCTTCC CCGCGCAACT AATGTATCAG
GAGTTGGCGG ACATGGCTTG TGAGCGCATA ACCGCCGGCA TCACCCGTTC GTTGGTGGGT
GAACGGCCCA TTAAGGCTAT ACTGGACCCC TATAACCCCA CTGGCTCCAC TATCCACGTG
AACTTCAATA CCTCGAAGAA AAACCGCTGG GAGACCGACC CACGCCGCTG TCACATCAAT
TGGGTCATCC TCGACAGCGA CTGGGAGGCC GAGTTCTGCC GCGTTGCTGA GTCCCATCCA
CGGGTCAAGG CTTATGTCAA GAACCACAAC CTCGGACTGG AAGTACCGTA CCGCTACGGT
TCGGAGGTAC GGAAGTACAT CCCCGACTTC ATCGTCCTAG TCGACGGCGG GCACGGCGAG
GACAACCTGC TCCACCTGGT CGTCGAGATC AAGGGGTACC GGCGCGAGGA CGCCAAGGAG
AAGAAGACCG CCATGGAGAC CTACTGGATA CCCGGGGTCA ATAAGCTCAA ACAGTATGGC
CGCTGGGCGT TTGCCGAGTT CACCGAGGTC TACCGCATCG AGGCCGACTT TGAGGCCAGG
GTCGAAGCCG AGTTCAACAA GATGATCGAT TCAGTCACCA CTCGGCCGAC GGTGGAGGGG
AGTTAA
 
Protein sequence
MDNQFFEKPI LNSPYEYPAR HWELDDQGQP TQRIIDRRRR AEFITPIPKP RKRKDSLDQQ 
QMVFDEGKGL STKAQQYDPT SIINEIRRHV DQWRSSSPGD WRVTPETARL LHHWRHHKFS
NIQPFFCQVE AVETVIWLTE VAPKIGKIGQ PFLEHLINAN NEANPGLLRL ALKLATGAGK
TTVMAMLIAW QTINAVRQPN SKRFTRGFLV VCPGLTIRDR LRVLQPNDPD SYYQSRELVP
NDMLRDLERA KIVITNYHAF KLRERMELSK GGRSLLQGRG AALNTLETEG QMLQRVMPEL
MGMKNILVLN DEAHHCYREK PKSDAEGELK GDDRREAEKN NEAARVWISG LETVNRKLGI
TRVIDLSATP FFLRGSGYAE GTLFPWTVSD FSLMDAIECG IVKLPRVPVA DNIPGGEMPK
FRNLWEHIRT RMPKKGRGKA KSLDPLSLPV ELQTALDALY GHYEKTYELW QKSGIRVPPC
FIVVCNNTST SKLVYDYISG FYRENEDGST TLENGRLALF RNFDEHGNPL PRPRTLLIDS
EQLESGEALD DNFRAMAADA IERFRREIIE RTGDRRQAEN LTDQELLREV MNTVGKEGRL
GESIRCVVSV SMLTEGWDAN TVTHVLGVRA FGTQLLCEQV IGRALRRQSY DLNEDGLFNV
EYADILGIPF DFTAKPVIAP PQPPRETIQV KAVRPDRDHL EIRFPRVEGY RVELPEERLT
AKFNDDSILE LTPDIVGPSI TKNAGIIGED VDLSLQHLED IRRSTLLFHL TKRLLYTKWR
DHGEEPRLHL FGQLKRITRQ WLDNYLVCKG GTFPAQLMYQ ELADMACERI TAGITRSLVG
ERPIKAILDP YNPTGSTIHV NFNTSKKNRW ETDPRRCHIN WVILDSDWEA EFCRVAESHP
RVKAYVKNHN LGLEVPYRYG SEVRKYIPDF IVLVDGGHGE DNLLHLVVEI KGYRREDAKE
KKTAMETYWI PGVNKLKQYG RWAFAEFTEV YRIEADFEAR VEAEFNKMID SVTTRPTVEG
S