Gene Dtox_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2274 
Symbol 
ID8429257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2442685 
End bp2443893 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content45% 
IMG OID645034581 
Productcysteine desulfurase NifS 
Protein accessionYP_003191711 
Protein GI258515489 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.363972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000604426 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCAGAG TCTATTTTGA CCACAGTGCA ACAACTCCAA TTGATACTGC CGTAGTAGAT 
GCTATGACAC CTTACCTGAC AGAATACTTT GGCAACCCGT CCAGTTTTCA CTCATACGGA
CGGGAAGTGC GTAAAGCTGT TGAAGCAGCC AGAGAAAAAG TGGCTCTGGC TATCGGTGCG
GATCCGAAAG AAATAACTTT CACCAGTGGC GGTACTGAAT CAGATAATAT GGCTATCCAC
GGGGTAGCTT ATATGAACAA GAATAAAGGC AATCATATTA TTACTTCTGC CGTTGAGCAT
CATGCTGTAT TAAATACTGT TAAGGCGCTG GGCAAAGAGG GTTTTGATAT AACCATTTTA
CCGGTAGATA AATATGGTAT GGTTAACCCT GATGATGTGG CTGCAGCTAT AACAGATAAA
ACCATATTAA TCACAATAAT GCATGCAAAT AACGAGATCG GCACTATTCA ACCGGTAAAG
GAAATAGCTG CGATAGCAAA AGCAAGAGGC GTGTATGTGC ATACAGACGC AGTGCAAAGT
TTTGGTAAAA TACCGTTGAA TGTAGACGAT TTAGGAGTAA ACCTGCTAAC TCTGTCCGCT
CATAAATTTT ACGGGCCAAA GGGTGTTGGT GTTCTATATA TACGCAAAGG AACCAGGTGG
AAGCAAACAC TCATGCACGG TGGGTCTCAA GAGCGCTTGC GCCGTACCGG AACGGAGAAT
GTTCCCGGTA TTATCGGTCT GGCCAAGGCG GCAGAAATTG TATCAGCCAA TCTGCAAAAA
GAGCAGGATT ACCTCACCAA ATTGAGGGAC AAATTAATCA AAGGCGTGAT GGACAACATT
GATAAGGTAA TATTAACCGG ACACCCAACA CAGCGTCTTT GCAACCTTGC CAGTTTTTGT
TTCGAATATA TTGAGGGTGA ATCACAGCTT CTCAGCCTGG ATATGAAAGG TATAGCTGCT
TCCAGCGGTT CAGCATGCAC ATCTGGCTCC CTGGAACCTT CACACGTATT GATGGCTTTG
GGATTAACCC ATGAGATAGC TCACGGATCG TTAAGACTTT CCTTGGGCAA AGATAATACA
GAAGAAGATG TAGATTATTT CTTAGAGGTT TTACCGGCGG TAGTTCAGAG GCTGCGGATG
ATGTCACCTC TGGCTGAAGA TGTTCCGGAG ATGGAAGAGT TTATGGAGGG GGTACGCTGC
GGTGTATAG
 
Protein sequence
MRRVYFDHSA TTPIDTAVVD AMTPYLTEYF GNPSSFHSYG REVRKAVEAA REKVALAIGA 
DPKEITFTSG GTESDNMAIH GVAYMNKNKG NHIITSAVEH HAVLNTVKAL GKEGFDITIL
PVDKYGMVNP DDVAAAITDK TILITIMHAN NEIGTIQPVK EIAAIAKARG VYVHTDAVQS
FGKIPLNVDD LGVNLLTLSA HKFYGPKGVG VLYIRKGTRW KQTLMHGGSQ ERLRRTGTEN
VPGIIGLAKA AEIVSANLQK EQDYLTKLRD KLIKGVMDNI DKVILTGHPT QRLCNLASFC
FEYIEGESQL LSLDMKGIAA SSGSACTSGS LEPSHVLMAL GLTHEIAHGS LRLSLGKDNT
EEDVDYFLEV LPAVVQRLRM MSPLAEDVPE MEEFMEGVRC GV