Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3666 |
Symbol | |
ID | 8430674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 3852268 |
End bp | 3853281 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 645035893 |
Product | major capsid protein HK97 |
Protein accession | YP_003192998 |
Protein GI | 258516776 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.017941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAAAC CAATCTTAGA ATATATGCCT GATGGCAGAT TAGACGAAAG GCTTATCCGT GGTGCTGCAT CTGGTATGAG TGAATCAGTT CCCAGTGACG GTGGCTTTCT CATTCAGCAA GATTTCACTT CTGAGCTATT AAAAAGGACA TATGAAACCG GTATCATTGC AAGCCGGTGC CGTAAACTTC CAATTAGCAC AAACGCTAAC GGCATGAAAA TAAATGCAAT TGATGAAAAC AGTCGGGCCA CTGGTTCCCG CTTGGGTGGT ATTAGGGCTT ACTGGGCCGC TGAAGCTGAA ACTGTGGCAG CATCTAAGCC TAAATTCAGG CAGATGGAGC TTAACTTACA AAAGCTTATT GGTTTGTGTT ATGCAACGGA TGAGCTTTTA GCAGATCAAG CCACTCTTGA ATCAGTATTA ATGGATGGTT TTGCTGAAGA ATTTGGATTC CTTGTTGACG ATGCTGTAAT CCGGGGTACT GGTGTAGGAA TGCCTCTGGG GATCTTAAAT TCAAATGCAG TTGTCACTGT TCCAAAGGAA AATGCTCAGG CGGCCAGGAG TCTTACAGCA GAGAACATTA TAAATATGTG GGCAAGGTTA TGGGCTCGTT CTCAACCTAA TGCAGTTTGG TTAATTAACC AAGATATAAT CCCGGAATTA TATCAACTTA AGATCCCTAT TGGTACTGCT GGACAACTTC TTTATATGCC GGCAAATGGT TTAAGTGAAA TGCCTTATGG CACCTTATTT GGTCGGCCGG TTATCCCTGT TGAGTATTGC GAAACATTAG GTACAAAAGG GGATATTATA CTGGCGGATT TTGGGCAGTA CGTTATTGCT GATAAAGGTG GGGTTACCTC CGCTGTTAGT ATTCATGTGC GCTTCATTTA CGATGAGCAG TGCTTTAGAT TCACATACCG TGTTTCAGGT CAAAGTTTCT GGAATGCACC TCTAAGCCCA TACCGTGGGA CTAATACCAT AAGTCCGTTT GTAGTATTAG AAACCCGCGT TTAA
|
Protein sequence | MLKPILEYMP DGRLDERLIR GAASGMSESV PSDGGFLIQQ DFTSELLKRT YETGIIASRC RKLPISTNAN GMKINAIDEN SRATGSRLGG IRAYWAAEAE TVAASKPKFR QMELNLQKLI GLCYATDELL ADQATLESVL MDGFAEEFGF LVDDAVIRGT GVGMPLGILN SNAVVTVPKE NAQAARSLTA ENIINMWARL WARSQPNAVW LINQDIIPEL YQLKIPIGTA GQLLYMPANG LSEMPYGTLF GRPVIPVEYC ETLGTKGDII LADFGQYVIA DKGGVTSAVS IHVRFIYDEQ CFRFTYRVSG QSFWNAPLSP YRGTNTISPF VVLETRV
|
| |