Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_4228 |
Symbol | |
ID | 8431242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 4398482 |
End bp | 4400266 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645036420 |
Product | phage uncharacterized protein |
Protein accession | YP_003193518 |
Protein GI | 258517296 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.76146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG GCCGCAAGGC TGATATCGTT TCACTGCTGT ATGAGGCCAT GGAGGAAAAT GAAAAACGTA GGCACGGCCA TGAGAAGCAG CAAAACGATG AAGCCAGAAT GCTTTTTGAA GAGTATGTAA AACGAGATTC AAGCCCGGCA AGATTGGAAC TATGGAAATC TTATCAGGCT GAGGCTCCGC TTATTGGTTC GGACGGGCTG AGGAAAAAGC TCGGGGCCAT GGATCTAGAG TATTTTGGCA GGGCTTATCT TCATCATTAT TTCACTCGGG AAACTCCTGA ATTTCACCGG GAGTTAGACC GGATTTGGCA ACAAGGCGTG CTTAAAGGGA TACTGCAGCT GACAGAGAAA ACGGTGGCCA AGATCCGGCG GTTGCCAGGC TGCCGCCGGG CGGTGGCCGC ACCCAGGGGC CACGCCAAGA GTACCAACCT GACCTTTAAG GATACCTTGC ACGCCATAGT CTATGAATAT AAGCCCTATA TACTGATACT GTCAGATTCA TCCGATCAGG CGCAGGGATT CTTGTCAGAT ATCCGGGGGG AATTGGAAGA GAACCTGGCC ATCAGGGAAG ATTTCGGAGA CCTTCAAGGG AAGAAAGCCT GGCGTGAAGA TGTACTGATG ACCTCCACGG ATGTGAAGAT TGAGGCCATC GGCAGCGGTA AGAAAATCCG GGGCCGACGC CATAAAAACT GGCGGCCTGG GCTGATTGTA TTGGATGATA TTGAAAACGA TGATAATGTC CGGACGCCGG AACAAAGAAA GAAGCTGGAA AATTGGTTCT TTAAAGCAGT GAGTAAGGCT GGTGACGACT ACACTGATAT TGTGTACATC GGCACCATTT TGCATTACGA CTCCCTTCTT TCCAAGGTGC TTAAGAATCC GGCCTATAAG TCAGTAAAAT ACCGGGCGAT CATCTCCTGG TCCGAACGCA AAGACCTGTG GGAAAAATGG GAAGACATTT ATATTGATCT GGACAATGAA AATCGGGAGC AAGATGCCAG GGCATTTTTT GAGGCCACTA AAGATGAAAT GCTAAAAGGT ACCCGGGTTT TATGGGAAGA TAAGCTTTCC TATTATGCTC TTATGGTGAT GCGGGTTTCT GAGGGTGAAG CCAGCTTTAA CTCTGAGGAA CAAAACGAGC CTATTAATCC AGAAGACTGC CTGTTCAACG AAGAGTGGTT CGAATATTAT AACGAGGCTG CCATTGATTT CAGGGAAAAA CGTTTCCGTT TCTTTGGCTT TGTTGACCCC TCTTTGGGGG GCAAGGGCAA GAAGAAGAAA AGCGACTTTT CCACAATCAT TACTTTGGTC AAGGATGGCC AGACCGGTTA TATGTATGTG CTTGATGCCG ATATCGAAAG ACGCCACCCG GACAGGATCA TCGAAGACAT TATGGAAAAG GAACGCTGGC TGAAGCTGAC ATTTGGCCGG GGATATTTCC AATTCGGCTG TGAGACAAAC CAGTTTCAAT GGTTTTTAAA AGAAGAATTG GCCAGGCGCA GCGCTGAAGC CGGTATTTAC CTCCCCATCG AGGAGGTAAA TCAAACCAGC GATAAATATG GACGAATCCA GACTTTGCAG CCTGATATAA AAAACAGGTA CATTAAATTT AACATCCGGC ATAAGCGTCT TTTGGAGCAA CTCAGGCAAT TTCCCATGGC GGCCCATGAT GATGGGCCGG ATGCCCTGGA AGCATGCCGA ACTCTGGCCA GATCTAAACA ACAGGTTGAC CAGGGCTTGC TGAATGTATT TAAAAAACTT CGGATATATG GGTGA
|
Protein sequence | MKKGRKADIV SLLYEAMEEN EKRRHGHEKQ QNDEARMLFE EYVKRDSSPA RLELWKSYQA EAPLIGSDGL RKKLGAMDLE YFGRAYLHHY FTRETPEFHR ELDRIWQQGV LKGILQLTEK TVAKIRRLPG CRRAVAAPRG HAKSTNLTFK DTLHAIVYEY KPYILILSDS SDQAQGFLSD IRGELEENLA IREDFGDLQG KKAWREDVLM TSTDVKIEAI GSGKKIRGRR HKNWRPGLIV LDDIENDDNV RTPEQRKKLE NWFFKAVSKA GDDYTDIVYI GTILHYDSLL SKVLKNPAYK SVKYRAIISW SERKDLWEKW EDIYIDLDNE NREQDARAFF EATKDEMLKG TRVLWEDKLS YYALMVMRVS EGEASFNSEE QNEPINPEDC LFNEEWFEYY NEAAIDFREK RFRFFGFVDP SLGGKGKKKK SDFSTIITLV KDGQTGYMYV LDADIERRHP DRIIEDIMEK ERWLKLTFGR GYFQFGCETN QFQWFLKEEL ARRSAEAGIY LPIEEVNQTS DKYGRIQTLQ PDIKNRYIKF NIRHKRLLEQ LRQFPMAAHD DGPDALEACR TLARSKQQVD QGLLNVFKKL RIYG
|
| |