Gene Dtox_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3666 
Symbol 
ID8430674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3852268 
End bp3853281 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content43% 
IMG OID645035893 
Productmajor capsid protein HK97 
Protein accessionYP_003192998 
Protein GI258516776 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.017941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAC CAATCTTAGA ATATATGCCT GATGGCAGAT TAGACGAAAG GCTTATCCGT 
GGTGCTGCAT CTGGTATGAG TGAATCAGTT CCCAGTGACG GTGGCTTTCT CATTCAGCAA
GATTTCACTT CTGAGCTATT AAAAAGGACA TATGAAACCG GTATCATTGC AAGCCGGTGC
CGTAAACTTC CAATTAGCAC AAACGCTAAC GGCATGAAAA TAAATGCAAT TGATGAAAAC
AGTCGGGCCA CTGGTTCCCG CTTGGGTGGT ATTAGGGCTT ACTGGGCCGC TGAAGCTGAA
ACTGTGGCAG CATCTAAGCC TAAATTCAGG CAGATGGAGC TTAACTTACA AAAGCTTATT
GGTTTGTGTT ATGCAACGGA TGAGCTTTTA GCAGATCAAG CCACTCTTGA ATCAGTATTA
ATGGATGGTT TTGCTGAAGA ATTTGGATTC CTTGTTGACG ATGCTGTAAT CCGGGGTACT
GGTGTAGGAA TGCCTCTGGG GATCTTAAAT TCAAATGCAG TTGTCACTGT TCCAAAGGAA
AATGCTCAGG CGGCCAGGAG TCTTACAGCA GAGAACATTA TAAATATGTG GGCAAGGTTA
TGGGCTCGTT CTCAACCTAA TGCAGTTTGG TTAATTAACC AAGATATAAT CCCGGAATTA
TATCAACTTA AGATCCCTAT TGGTACTGCT GGACAACTTC TTTATATGCC GGCAAATGGT
TTAAGTGAAA TGCCTTATGG CACCTTATTT GGTCGGCCGG TTATCCCTGT TGAGTATTGC
GAAACATTAG GTACAAAAGG GGATATTATA CTGGCGGATT TTGGGCAGTA CGTTATTGCT
GATAAAGGTG GGGTTACCTC CGCTGTTAGT ATTCATGTGC GCTTCATTTA CGATGAGCAG
TGCTTTAGAT TCACATACCG TGTTTCAGGT CAAAGTTTCT GGAATGCACC TCTAAGCCCA
TACCGTGGGA CTAATACCAT AAGTCCGTTT GTAGTATTAG AAACCCGCGT TTAA
 
Protein sequence
MLKPILEYMP DGRLDERLIR GAASGMSESV PSDGGFLIQQ DFTSELLKRT YETGIIASRC 
RKLPISTNAN GMKINAIDEN SRATGSRLGG IRAYWAAEAE TVAASKPKFR QMELNLQKLI
GLCYATDELL ADQATLESVL MDGFAEEFGF LVDDAVIRGT GVGMPLGILN SNAVVTVPKE
NAQAARSLTA ENIINMWARL WARSQPNAVW LINQDIIPEL YQLKIPIGTA GQLLYMPANG
LSEMPYGTLF GRPVIPVEYC ETLGTKGDII LADFGQYVIA DKGGVTSAVS IHVRFIYDEQ
CFRFTYRVSG QSFWNAPLSP YRGTNTISPF VVLETRV