Gene Dtox_0987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0987 
Symbol 
ID8427926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1010072 
End bp1011307 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content47% 
IMG OID645033325 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003190499 
Protein GI258514277 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.965573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00212312 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTAAGTC AACCGGATCA ATTAAAAGAT ATACAGAGTC AATTTCAAAA ATGCGTGCGC 
TGCGGGCTCT GCCGATCGGT CTGTCCTATT TTTAAAGAAG ACCGCAGGGA AACCGCAGCT
CCCAGAGGCA AAGTATTTTT AGCACAAATG CTGGCAGAAG GAGAAATTAC ACCTGACAGC
AAAGCCGCCC AAAACCTTTC CATGTGCCTG ATGTGTGAAT CCTGTTCCAG CGAGTGCCCT
TCGGGAATCG AAGTGCACAA AATAGTCAGC CTGGCTCGAT CCATGGTTAA TGAAAACAAT
CCTTCATTAA CAAACAAGGC AAATAAATTG ATCTTCAAAG ATTTATGGAG CAAACCTTCT
TTAATGAATT TAAGTTTCAA TCTAATTAAA ACCGGTCAAG CACTCGGACT GCTGGATTTC
GGCACCAAAT CAGGACTGCT GCCAAAATCA GGACGCCTGC TGGGTGAACT GCCAGGGAAA
CCGGCACGAC AGGCACTGCC GGAAATAGTA CCCCCCACAA CCAGACAAAA GGCGCGCACC
GGGTATTTTT TGGGCTGTGC CACCAACTAC CTCTATCCTC AAGTGGCTTT CAGTACAGTA
AAAATACTGT CACACCTTGG CTGTGAAGTA GTAATACCTC GTGAACTAAC CTGCTGCGGC
TTGCCTCAGC TGGCTAACGG CGAACCTGCT GCCGGACACA ATTTAGCCAG GCAAAACTTG
CAAATCTTTA AACGGGCCGG GGTTGAAGCA GTAGTCTGTG ACTGTGCTTC TTGCAGTGCT
ACCTTAGCGG AAAACTGGGG ACAAGCTCTA CCGGTATATG ACGCCGTAAA ATATATTATA
CAGGAATTAA AGCTGGATTT GTCAGATAAA AAACAAATTA ATAATCAACC AATTAAAATA
GTAACCTACC ATGATCCCTG CCACCTAGCC AAAGCACAAA GAATCAGGCA GCAGCCACGA
CAATTACTGC AGATGCTGCC GGGCGTGGAA TACAGGGAAA TGCCCGGGGC CGATAACTGC
TGCGGCGGTG CCGGCACTTT CGTCGTGAAA AACTATGATC TGAGCATGCG TATTCTGGAT
CGAAAAATCG CATCCATCAA AGAAACCGGT GCTGACATTG TAGCCACCTG CTGTCCTACC
TGTACTATGC AGCTTAAACA CGGTTTGGAT AAGCACGGAC TTCAAATTGA AGTAAAACAC
CCACTGGAAC TCCTGGCCGA GACACTCGGG CTATAG
 
Protein sequence
MLSQPDQLKD IQSQFQKCVR CGLCRSVCPI FKEDRRETAA PRGKVFLAQM LAEGEITPDS 
KAAQNLSMCL MCESCSSECP SGIEVHKIVS LARSMVNENN PSLTNKANKL IFKDLWSKPS
LMNLSFNLIK TGQALGLLDF GTKSGLLPKS GRLLGELPGK PARQALPEIV PPTTRQKART
GYFLGCATNY LYPQVAFSTV KILSHLGCEV VIPRELTCCG LPQLANGEPA AGHNLARQNL
QIFKRAGVEA VVCDCASCSA TLAENWGQAL PVYDAVKYII QELKLDLSDK KQINNQPIKI
VTYHDPCHLA KAQRIRQQPR QLLQMLPGVE YREMPGADNC CGGAGTFVVK NYDLSMRILD
RKIASIKETG ADIVATCCPT CTMQLKHGLD KHGLQIEVKH PLELLAETLG L