Gene Dtox_4105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4105 
Symbol 
ID8431119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4275646 
End bp4276857 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content46% 
IMG OID645036300 
Productprotein of unknown function DUF214 
Protein accessionYP_003193398 
Protein GI258517176 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0276302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAT TTGTTAACTT TAGGCAGGCT GTGAGGGGAA TTTTTAAAAA GAAGGTGCGT 
GCTTTTTTGG CTATGTTAGG TATCAGCACC TCCATAGCTT CTATAATTCT TTTACTTTCA
CTAGGGCAGG GTGTGCAGGA AAAGGTATCT TCACAGTTTG AGGCTCTGGG GGCTAACCAA
CTTATAATCA TTCCCATGCG CACTCTCAAA ACCACCTCTA ACCAACCGGA CTTTTTGAGA
GGTCTGCCTA CATTTACCGG CACTCTGAGT CTGGAAGATA TTGATGCTAT TCAAAAAGTA
CCTGACGTGG CTGTGGCAGC GCCTCAATCG GAAAATGTCC TCACTGTAGT AAGGCCGGGT
GGTATTAACA GCGCAGATAC CTTAATCACC GGCGTTACAC CGGAGTACTT TACACTGGCA
AAGCTAAATC TTTCAGCAGG TCAATTTTTC ACTTCCGCTG ATGAAACATC ACAGAATGTT
GTTCTTGGCA GCGCCGTTAA AGAAACCCTA TTTGGTGAAG AAAACGCCCT GGGCCAACAA
ATTATCATTA GAAATAAAAG ATTTAATGTT GCCGGTGTGC TGGAACCTAA AGAAAGCATA
GGTTTTTCTT TTAATGAAAG AGTATATCTG CCTATTGAAA AAGCTCAGTC TATTACCGGG
CTGGATAAGC TGTCCCTGAT TCTGGTACAG GCAAAAAGCA ATGATGTGGT AGACAGGTGC
AAGGATAAAA TCGGCTCAGC ACTTCGCCCG CTGCATAAAA CCACTGACTA CAGTATTCTA
AAACAGGGTG AAATGCTCTC TATGATTGAC AAGTTTTCTG CCGTACTGAC TGCTATGCTA
ACCGGAATTA CAGGAATAAC ACTGGTCATT TCGGCAATCG GGATTACTAA CGTAATGCTG
TTGACAGCTA TTGAGAGGAC AAGGGAAATA GGGATCAGGA AGGTACTGGG GGCAACCACT
TTTGATATCT TCATACAATT TGTCTTTGAG GCCGTTCTTA TAGCCGCCCT GGCAGGTTTA
ATCGGTATCG CGGCGGGTTA CGGGTTTATC AAGATATTAT CGCACTACCT GCCGTCCCTA
CCCTTTAAAA TCACCTGGTT ATCGATCGCC AGGACAGGGC TGGCGGCAAC TACTGCCGGC
ATTATCTTCG GCCTCTACCC GGCTGTCCGG GCCGCCCTGC TGCAGCCGGC CAGAGCTATC
AGACATAAAT AA
 
Protein sequence
MKLFVNFRQA VRGIFKKKVR AFLAMLGIST SIASIILLLS LGQGVQEKVS SQFEALGANQ 
LIIIPMRTLK TTSNQPDFLR GLPTFTGTLS LEDIDAIQKV PDVAVAAPQS ENVLTVVRPG
GINSADTLIT GVTPEYFTLA KLNLSAGQFF TSADETSQNV VLGSAVKETL FGEENALGQQ
IIIRNKRFNV AGVLEPKESI GFSFNERVYL PIEKAQSITG LDKLSLILVQ AKSNDVVDRC
KDKIGSALRP LHKTTDYSIL KQGEMLSMID KFSAVLTAML TGITGITLVI SAIGITNVML
LTAIERTREI GIRKVLGATT FDIFIQFVFE AVLIAALAGL IGIAAGYGFI KILSHYLPSL
PFKITWLSIA RTGLAATTAG IIFGLYPAVR AALLQPARAI RHK