Gene Dtox_2678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2678 
Symbol 
ID8429664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2839138 
End bp2840094 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content45% 
IMG OID645034956 
Productcysteine synthase 
Protein accessionYP_003192083 
Protein GI258515861 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01138] cysteine synthase B 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTGCG ATAATATACT CAAGACCATA TGCAATACGC CGATGATCAG AATTAATCGG 
CTGAATCCAA ACCCAAATGT GGAGATCTAT GCAAAATTTG AAGGCACAAA TCCCGGAGGA
AGCATAAAAG ATCGGATTGC GCTTAAGATG ATCGAGCAGG CCGAGGCCGA AGGTGTACTT
AACAGGAAAA AAACCATCAT AGAGGCAACT TCTGGCAATA CAGGCATTGC GCTGGCGATG
ATCGGAGCGG TCAAGGACTA CAAGGTGGAA ATTGTTATGA GTGAAGCCGT ATCGATCGAA
AGGCGCAAGA TGATACAGGC ATTCGGCGCG AAGGTCATCC TGACCGATCC GGAATTTGGA
ACGGACGGTG CTATTCTTAA AGTACGCAAG TTGCTGGAGC AATATCCGGA TCGCTATTTC
TGCACAGATC AGTTCACGAA TAAGTACAAT AAACTCGCCC ATAGTGAAAT TACTGCCGAA
GAGATCTGGT TCCAAACGAA TGGCAGAGTT GATTATTTCG TTTCAGGGTT GGGAACATCG
GGAACCTTGA TGGGGGTTGG TGCCGGCCTG AAAAAGTACA ATCCTAAAAT AAAAATCATC
AGTGCGGAAC CGGTTGCCGG GCATTATATT CAGGGTCTGA AAAATCTTCA GGAAGCGATT
GTTCCGGGCA TTTATAATGA AGCTGAACTG GATGAAATTA TTATGATCGA AACTGAGGAA
GCCTTCGAGA TGGCCCGTCA GATTGTCCGC AAGGAAGGGA TCTTTGTCGG CATGAGCAGT
GGTGCGTCCA TGTTAGGAGC GGTTAAAATT GCCCGCAAGC TTTCTTCAGG TGTAATTGTT
ACTATTTTTC CTGATCGTGG AGAAAAATAT TTAAGCACGG ATCTCTTCAA GTCTGAGGCA
GGTGATGGAA AATTAGGACG GATGGAGGCA GACAGATTTT GCCGCAAATA CAAATAA
 
Protein sequence
MICDNILKTI CNTPMIRINR LNPNPNVEIY AKFEGTNPGG SIKDRIALKM IEQAEAEGVL 
NRKKTIIEAT SGNTGIALAM IGAVKDYKVE IVMSEAVSIE RRKMIQAFGA KVILTDPEFG
TDGAILKVRK LLEQYPDRYF CTDQFTNKYN KLAHSEITAE EIWFQTNGRV DYFVSGLGTS
GTLMGVGAGL KKYNPKIKII SAEPVAGHYI QGLKNLQEAI VPGIYNEAEL DEIIMIETEE
AFEMARQIVR KEGIFVGMSS GASMLGAVKI ARKLSSGVIV TIFPDRGEKY LSTDLFKSEA
GDGKLGRMEA DRFCRKYK