Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3956 |
Symbol | |
ID | 8430971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 4137514 |
End bp | 4140405 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645036174 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003193272 |
Protein GI | 258517050 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000293672 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00020375 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGGACA GAATCGTGGT TAAAGGAGCA AGGGTGCATA ATCTGAAAAA TATTGATGTG GAAATACCCA GGGATAAGCT GGTAGTAATT ACGGGTTTGT CCGGTTCCGG CAAGTCCTCT CTGGCTTTTG ACACTATTTA TGCCGAGGGT CAGCGCCGCT ACGTGGAGTC GCTCTCGGCT TATGCGCGCC AGTTTTTAGG ACAGATGAAC AAGCCGGATG TGGATTATAT AGAAGGGCTG TCTCCGGCCA TTTCTATCGA TCAGAAAACA ACTTCCCATA ACCCTCGCTC CACTGTGGGG ACAGTTACCG AAATCTATGA TTACCTGCGC CTGCTTTTTG CCAGGGTGGG GAGACCTCAC TGCCATAAGT GCGGCAAGCC TATTACCAGG CAGACGGTGC AGCAGATAGT TGACCGGTTG ATGCTGCTGC CGGAAAGCAC TAGGCTGCAG ATACTGGCCC CGGTGATCAG GGGTAAAAAA GGTGAGCATG TAAAAGTACT GGAAGACATA AGGCGCGGCG GTTTTGTCCG GGTGAGAGTA GACGGCGAAA CCAGGGAACT GGGTGAAGAA ATCAAGCTGG AGAAAAATAA GAAGCATACC ATTGAAGTGG TAGTGGACAG GGTGATTATC AGGGCCGGTT CGGAGAAACG ACTGGCTGAT TCACTGGAAA CGGCTCTGCA GCAGAGCGGC GGCATTGTGC TGGCCAGTAT CACGGACGGG GAAGAGTTGA TTTTCAGTGA AAACTTTGCC TGTGTGGACT GCGGCATCAG CGTGCAGGAG ATAGCTCCCA GATCCTTTTC CTTTAACAAC CCTTACGGCG CTTGCCCGGA ATGTACCGGT CTCGGCACTA AGCTGGAAAT TGATCCCAAC TTAATTATCC CGGACATGAA TCTTTCTATA GCTGAAGGGG CCATCGAAGG TTGGCATAAA GGAAATATCT CCGCTTCCTA TTTCAGCGGT CTGGCCGAAC ACTATGGCTT TAGCCTGGAT ACACCTGTAA AAGAACTGAA GCCTGATCAC CTGCAGGTAC TGCTCTATGG CACCGGTGAG CAAAAAGTGC GCATTATTTA TACTGATGTG TACGGGCGGC GGCATGATTA CAAGATGCCT TTTGAAGGTA TTATTAATAA CATTGCCAGG CGCTACAGGG AGACAGCCTC CGAGCATATG AGAAATGAAT TTGAACAGTA TATGAGTTCG GTGATTTGTC CGGTCTGCGG CGGGGCCAGG CTGAAGCCTG AGGTGCTGGC GGTAAAAATA GGCGGCTTGT CCATACATGA AGTAACCTGT TTAACGGTTA CCGACACATT GCATTTCTTT GAAAAGTTGG ATTTGACTGA GCGTGAGCGG GTGATAGCCA GGCAGATATT AAAGGAAATT AATGAGCGGT TAGGTTTTTT GATTAATGTG GGTTTAAACT ACCTGACTCT GAACCGGACA GCCGGTACTC TTTCCGGGGG CGAGGCGCAG AGGATCCGCC TGGCTACTCA AATTGGAGCA GGCTTGATGG GGGTTTTGTA TATACTGGAC GAGCCCAGCA TCGGTTTACA CCAGCGGGAT AACGAGAGAT TGTTAAATAC CCTGCGCCGC TTGAGGGATA TAGGCAATAC TTTAATTGTG GTGGAGCATG ATGAGGATAC GGTGCGCACG GCTGATTATA TTATTGATAT CGGGCCGGGA GCCGGTGTGC ACGGCGGGCA GTTGGTGGCT GCCGGGACTT TGCGGGAAAT TCTGGACAAT GAAAATTCTC TAACAGGCCA GTATTTAAGC GGCAGAAAGT ATATTCCGGT ACCGGACAGC CGCCGGGAGC CTAACGGCAA GTATGTGGAA GTTAAAGGGG CGGAAGAAAA TAATCTTAAA AATATTGATG TGCGCTTTCC CCTGGGGGTA TTCACCTGTG TTTCCGGTGT TTCCGGTTCC GGTAAAAGTA CTCTGGTTAA CGAAATTTTA TATAAAACCT TAAGCCAGGA ACTGCACGGG GCCAGGAGCA AGCCGGGTTG CTGCCGGGAA GTGGGGGGCC TGGAATATCT GGACAAGGTG ATAGATGTGA ACCAGTCTCC TATCGGGCGT ACTCCCCGAT CCAACCCGGC CACTTATACC GGGGTGTTTA CCTATATCCG GGAATTATTT GCCCAGACGC CGGAAGCCCG TATGAGGGGC TATAAGCCCG GGCGCTTCAG CTTTAACGTT AAGGGCGGGC GCTGTGAGGC CTGTCAGGGA GACGGCATTA TAAAAATAGA AATGCATTTT TTGCCGGACG TTTATGTTCC GTGCGAGGTT TGCAAAGGAC GCCGCTACAG CAGAGAAACC CTGGAAGTAA CCTATAAAGG CAAGAGCATT GCCGATGTGC TGGATATGAC GGTTGAGCAG GCTGTGGAAT TCTTCCGCCA CATACCGAAG ATTCACCGTA AAATGGAGAC TATGCAGGAT GTCGGTTTGG GTTATATTCG TCTGGGTCAG CCGGCGCCGG AACTTTCCGG CGGTGAAGCG CAGCGGGTAA AGCTGGCTGC CGAGTTGTCC CGCCGCTCCA ACGGCAAAAC CTTTTACATT TTGGACGAGC CGACTACCGG TTTGCACACT GATGATATAG CCAGGTTGTT AAAGGTACTG CACCGCCTGG TGGAAGCGGG GGATACTGTG GTGGTCATTG AGCATAATCT GGATGTGATT AAAACAGCGG ATTATATAAT TGACTTAGGA CCGGAAGGCG GGGACAAGGG CGGCAGCGTG GTAATTGCCG GGACGCCGGA GGAAGTGGCG GCTGAGACAC AGTCTCACAC GGGCAGGTTT TTAAAGAAGG TTTTGCCGGC CGGAGTGGAA GCTGCAGCCG GCGGCAGGGA AATGGGCGAT GCGGAGGAGA ATGCGGCTGC CGGTGAGGCG CAGGCTATAT AG
|
Protein sequence | MLDRIVVKGA RVHNLKNIDV EIPRDKLVVI TGLSGSGKSS LAFDTIYAEG QRRYVESLSA YARQFLGQMN KPDVDYIEGL SPAISIDQKT TSHNPRSTVG TVTEIYDYLR LLFARVGRPH CHKCGKPITR QTVQQIVDRL MLLPESTRLQ ILAPVIRGKK GEHVKVLEDI RRGGFVRVRV DGETRELGEE IKLEKNKKHT IEVVVDRVII RAGSEKRLAD SLETALQQSG GIVLASITDG EELIFSENFA CVDCGISVQE IAPRSFSFNN PYGACPECTG LGTKLEIDPN LIIPDMNLSI AEGAIEGWHK GNISASYFSG LAEHYGFSLD TPVKELKPDH LQVLLYGTGE QKVRIIYTDV YGRRHDYKMP FEGIINNIAR RYRETASEHM RNEFEQYMSS VICPVCGGAR LKPEVLAVKI GGLSIHEVTC LTVTDTLHFF EKLDLTERER VIARQILKEI NERLGFLINV GLNYLTLNRT AGTLSGGEAQ RIRLATQIGA GLMGVLYILD EPSIGLHQRD NERLLNTLRR LRDIGNTLIV VEHDEDTVRT ADYIIDIGPG AGVHGGQLVA AGTLREILDN ENSLTGQYLS GRKYIPVPDS RREPNGKYVE VKGAEENNLK NIDVRFPLGV FTCVSGVSGS GKSTLVNEIL YKTLSQELHG ARSKPGCCRE VGGLEYLDKV IDVNQSPIGR TPRSNPATYT GVFTYIRELF AQTPEARMRG YKPGRFSFNV KGGRCEACQG DGIIKIEMHF LPDVYVPCEV CKGRRYSRET LEVTYKGKSI ADVLDMTVEQ AVEFFRHIPK IHRKMETMQD VGLGYIRLGQ PAPELSGGEA QRVKLAAELS RRSNGKTFYI LDEPTTGLHT DDIARLLKVL HRLVEAGDTV VVIEHNLDVI KTADYIIDLG PEGGDKGGSV VIAGTPEEVA AETQSHTGRF LKKVLPAGVE AAAGGREMGD AEENAAAGEA QAI
|
| |