Gene Daud_0294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0294 
Symbol 
ID6025503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp316620 
End bp319460 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content67% 
IMG OID641593148 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001716487 
Protein GI169830505 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGACA AAATCGTCGT TCGCGGCGCG CGCGTACACA ACCTGAAGAA CATCGTCGTG 
GAGATCCCCC GGAACAAGCT GGTTGTACTG ACCGGTTTGT CCGGAAGCGG GAAGTCTTCG
CTGGCTTTTG ACACCATCTA CGCCGAAGGG CAGCGCCGCT ACGTGGAGTC ACTGTCGGCC
TACGCACGGC AGTTCCTGGG CCAGATGGAC AAACCCGACG TCGACTACAT CGAGGGCCTG
TCCCCGGCCA TCTCCATCGA CCAGAAAACC CGTTCGCACA ACCCCCGGTC CACGGTGGGC
ACCGTGACCG AGATTTACGA CTATCTGCGG CTGCTGTTCG CCCGGGTGGG ACGGCCCCAC
TGCCCGCATT GCGGCCATCC GATCAGTAGC CAGACGGTGG AGCAGATGGT CGACCAGGTG
ATGGCGCTGG GGGAGGGCAC CCGCCTGCAG ATCCTGGCCC CGGTCGTCCG GGGCAAGAAG
GGCGAACACG TGCGGGTGCT GGAGGAGGCC CGGCGCGGCG GGTTTGTCCG GATGCGCATC
GACGGGGAAA TGCGGGAGAT CGGCGAGGAT ATCAGCCTCG ACCGCAACAA GAAGCACACC
ATCGAGATCG TCGTCGACCG GTTGGTGGTC AAGCCGGGCC TGGAGCCCCG GCTGGCCGAC
TCCCTGGAGA CGGCCCTGAA GCAGACCGGC GGCCTGGCCG TCTGCGCTGT CATTGACGGG
CCGGAGATCG TGTTCAGCCA GAATTTCGCC TGCGTGGACT GCGGTTTCAG CTTCAGCGAG
ATCACGCCGC GCCTGTTTTC CTTCAACAGC CCGGTGGGGG CCTGCCCGAC GTGCACCGGC
CTGGGCAGCC GCCTGGAGGT CGACGTCGAC CTGGTTATCC CCGACCGGAA CAAGACCTTG
TACGAAGGCG CCGTGGCCGC CTGGGCCTGG TGGACCCGGG GTTACCACAT CCTTGAGGGC
CTGGCCCGGC ACTACGGGTT CAGCCTGCAG GTCCCGGTGC GGGAGCTTGA CCCGGACCAC
CTGGACATCA TTCTGTACGG CACCGGCGGG ACCCGCATCT CCTTCAGCTA CCGGGACATG
ACCGGGCGCT TGCGCCGCTA TACCGCGCCG TTTGAAGGGG TGGTGTCCTT CCTGAGTCGC
AAGCACCGGG AGACCACCTC CGACCACGTC CGCGAGGAGA CCGAGCGGAT GATGCGCACC
AGGCCCTGTC CGGACTGCGG GGGGCGCCGG TTGAAGCCGG AGGCGCTGGC CGTCAAGGTG
GGCGGAAAAT CGATCGCCGA GGTGGCCGCG CTGTCGATTA CCGCGGCCGA CCGGTTTTTT
GCCGGCTTAA ACCTGACCGC CCGGGAGACG CTGATCGGGC AGCGGGTGCT CAAGGAACTG
CGGGCCCGGC TGGGTTTCCT GGTCAACGTC GGCCTGGACT ACCTGACCCT GGACCGCCAG
GCGGCCACGC TGTCGGGCGG TGAGGCCCAG CGGATCCGGC TGGCCACCCA GATCGGTTCC
GGCCTGGTGG GGGTGCTCTA CATCCTGGAC GAGCCGAGCA TCGGGCTGCA CCCCCGGGAC
AACGGACGGC TCCTGGACAC CCTGAAGCAG CTGCGCGACC TGGGGAACAC CCTGATCGTC
GTCGAACACG ACGAGGAGAC CATCCGGGCC GCCGACCACA TCATCGACAT CGGCCCCGGC
GCCGGGCTGG ACGGCGGCCG GGTGGTGGCG GCGGGCACCC TGCGGGAGAT CACCGGCACG
GACGCTTCCA TCACCGGGCA GTACCTGGCC GGCCGGAAGT TTATCGCGGT GCCCCCGGCG
CGCCGGGCTC CCGGAGAACG GTGGGTCGAG GTCATCGGGG CGCGTGAGCA CAATCTGAAC
AACATCGACG TGCCCTTTCC GCTGGGCGTG TTCACATGCG TCACTGGGGT ATCCGGTTCC
GGGAAGAGCA CCCTGGTGAA CGAAATTCTG AACAAGGCGC TGGCCGCCGC CCTGCACGGC
ACCCGGACCC ACCCGGGGGT GCACGACGAA ATCCGGGGCA CGGAGCATCT GGACAAAGTG
ATCAACGTCG ACCAGTCGCC GATCGGCCGG ACCCCCCGTT CGAACCCGGC CACCTACACC
GGGGTGTTCA CCGATATCCG GGAGCTTTTC GCGCTCACCC CGGAGGCCCG GATGCGCGGT
TACAAGCCCG GCCGGTTCAG CTTCAACGTC CGGGGCGGGC GCTGCGAGGC CTGCCGGGGC
GACGGGATCA TCCGGATCGA GATGCACTTT CTGCCCGACG TCTATGTGCC CTGCGAGGTG
TGCGGGGGAC GCCGGTACAA CCGGGAGACG CTGGACGTCC GGTACAAGGG GCGGACCATC
GCCGACGTCC TGGACATGAC GGTGGACCAG GCCGCCGAGT TCTTCGCGCC GGTCCCGAAG
ATCTACCGCC GCCTGTGCAC CCTGCAGGAC GTGGGCCTGG GCTACATCCG CCTCGGCCAG
CCGGCCACCA CGCTCTCGGG CGGCGAAGCC CAACGGGTGA AGCTGGCCGC CGAACTTTCC
CGGCGGGCGA CCGGACGGAC GCTGTACATC CTGGACGAGC CCACCACCGG GCTGCATTTC
GCGGACATCA AGAAGCTTTT GCAGGTGTTG CAGCGCTTGG TGGACGCCGG GAACACGGTA
ATCGTGATCG AACACAACCT GGACGTGATC AAAACAGCCG ACTACATCAT CGACCTTGGT
CCCGAAGGCG GGGACCGGGG CGGCCGGGTG GTGGCCACCG GCACTCCCGA AGAAGTGGCG
GCCCAGGCTG AATCTTACAC CGGCCGGTTC CTGCGGCGTG TGTTGGACCG GACCCCCGGC
CGCGCCGCCG CCGAAGCCTG A
 
Protein sequence
MQDKIVVRGA RVHNLKNIVV EIPRNKLVVL TGLSGSGKSS LAFDTIYAEG QRRYVESLSA 
YARQFLGQMD KPDVDYIEGL SPAISIDQKT RSHNPRSTVG TVTEIYDYLR LLFARVGRPH
CPHCGHPISS QTVEQMVDQV MALGEGTRLQ ILAPVVRGKK GEHVRVLEEA RRGGFVRMRI
DGEMREIGED ISLDRNKKHT IEIVVDRLVV KPGLEPRLAD SLETALKQTG GLAVCAVIDG
PEIVFSQNFA CVDCGFSFSE ITPRLFSFNS PVGACPTCTG LGSRLEVDVD LVIPDRNKTL
YEGAVAAWAW WTRGYHILEG LARHYGFSLQ VPVRELDPDH LDIILYGTGG TRISFSYRDM
TGRLRRYTAP FEGVVSFLSR KHRETTSDHV REETERMMRT RPCPDCGGRR LKPEALAVKV
GGKSIAEVAA LSITAADRFF AGLNLTARET LIGQRVLKEL RARLGFLVNV GLDYLTLDRQ
AATLSGGEAQ RIRLATQIGS GLVGVLYILD EPSIGLHPRD NGRLLDTLKQ LRDLGNTLIV
VEHDEETIRA ADHIIDIGPG AGLDGGRVVA AGTLREITGT DASITGQYLA GRKFIAVPPA
RRAPGERWVE VIGAREHNLN NIDVPFPLGV FTCVTGVSGS GKSTLVNEIL NKALAAALHG
TRTHPGVHDE IRGTEHLDKV INVDQSPIGR TPRSNPATYT GVFTDIRELF ALTPEARMRG
YKPGRFSFNV RGGRCEACRG DGIIRIEMHF LPDVYVPCEV CGGRRYNRET LDVRYKGRTI
ADVLDMTVDQ AAEFFAPVPK IYRRLCTLQD VGLGYIRLGQ PATTLSGGEA QRVKLAAELS
RRATGRTLYI LDEPTTGLHF ADIKKLLQVL QRLVDAGNTV IVIEHNLDVI KTADYIIDLG
PEGGDRGGRV VATGTPEEVA AQAESYTGRF LRRVLDRTPG RAAAEA