Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1849 |
Symbol | |
ID | 8428828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1963169 |
End bp | 1964569 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645034185 |
Product | phage minor structural protein |
Protein accession | YP_003191319 |
Protein GI | 258515097 |
COG category | [S] Function unknown |
COG ID | [COG4926] Phage-related protein |
TIGRFAM ID | [TIGR01665] phage minor structural protein, N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0322579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.615795 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTATATT TATTTGATTC AGCAGAAAAA CTGTTAGCCA TCTACTCCCA GGAAAATGCC ACTTGCCCAT ATTACGATGC AGTACATACA GAAAAACTTA CCGGGGAGAA TACCTTTATT TTTACTATTC CGGCAGACCA CCAGGATAGC CGTTATATTA CTGAGGGGAA TCTTGTGGGA TTTAAAGACC CATATAAAGA CTGGCAGCTT TTCGAGATAA AACGTATTAC AGATATTCAT GGCGAGGGTT TAACCCGAAC TGCATACTGC GAGCATGTAC TTTATGAACT TATAGATGAT TTTATCGAAG ATATACGGCC TACCGATTGT ACCGCATTAA TTGCGTTAAT TAAGGCTTTA GAGGGAACCC GCTGGGAGCC TGGAGTTGTT GATGACCTGG GGGTAAACAG TACCAACTTT TACTATGAAT CTGCACTTTC CGCTGTTCAA AAGGTTGCCG CCATATGGAA AGGTGAATTG AGATTTCGAG TAGTTATCTC CAACAATGCT ATTACCAAGC GGTATGTCGA TTTACTGGCC AGGCGTGGGG CAGTAACAGG CAAGCAGTTT ACCTACGACC GCAACACCTC GCAGATTGAG CGAGAGGTTG ATTTAACCAG CGTTGTAACT GCTCTCTATG GCAGGGGAAA AGGCGTGGAG GTGGGTGACG GGTACGGCAG ACGTTTAGAT TTCAGCGGAA TAGGATGGGC CGTAGCCAAC GGAAATCCGG CTGATAAACC CTTGAGCCAA CGATGGATTG GAGATGCTCA GGCTCTCGCT CAGTGGGGCA GGGCAGGCCG TCATCGCTTT GGCGTGTTTG AGGACTCTGA GGAAACTGAC CCGGCGGTAC TATTACAAAA AACCTGGGAT ACCCTACAGG AACGAAAAAT GCCAAGAGTA ACCTATAGCC TGGATGTTGT TGACCTGGAG AGTTTAAGCG GATATGGTCA CGAAAAGGTA AGATTAGGTG ATACGGTCAG GGTTATCGAT AGGAAGTTTA ACCTAGAGAT TTTGGTGGAA GCGAGAATAC TGGAGATTAA CCGAAACCTT TTAAAACCGG AGGATACCGA GATTACCCTG GGTAACTTTA CCCCAAGTAT AACCGATGAA GCCTTGAAGC AAATGGAAAT TAATCGAGCC GTTAATGATA AGCAGGGCGT ATGGGATAGG GCCAGCCAGT TTAATGCGGA CGGTACATTA AGTGCCGGTA AGCTGACGGA TACACTGGTA GGCTTAGACC ATACTTTGCA ACTGGCGAGC GAAGCTGTGA CCGAGGCTAA AATAGCTGTA GGGGCCATTT CAACTCCTAA ACTCGCTACT AACGCTGTTA CCGCAGATAA ACTCGCACCC GGTACTATAA ATGAGGCAAA GATGAACTGG AAAACACATC TTTTGTATTA A
|
Protein sequence | MLYLFDSAEK LLAIYSQENA TCPYYDAVHT EKLTGENTFI FTIPADHQDS RYITEGNLVG FKDPYKDWQL FEIKRITDIH GEGLTRTAYC EHVLYELIDD FIEDIRPTDC TALIALIKAL EGTRWEPGVV DDLGVNSTNF YYESALSAVQ KVAAIWKGEL RFRVVISNNA ITKRYVDLLA RRGAVTGKQF TYDRNTSQIE REVDLTSVVT ALYGRGKGVE VGDGYGRRLD FSGIGWAVAN GNPADKPLSQ RWIGDAQALA QWGRAGRHRF GVFEDSEETD PAVLLQKTWD TLQERKMPRV TYSLDVVDLE SLSGYGHEKV RLGDTVRVID RKFNLEILVE ARILEINRNL LKPEDTEITL GNFTPSITDE ALKQMEINRA VNDKQGVWDR ASQFNADGTL SAGKLTDTLV GLDHTLQLAS EAVTEAKIAV GAISTPKLAT NAVTADKLAP GTINEAKMNW KTHLLY
|
| |