Gene Dtox_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0572 
Symbol 
ID8427507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp592107 
End bp593477 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content48% 
IMG OID645032937 
Productargininosuccinate lyase 
Protein accessionYP_003190115 
Protein GI258513893 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGC TTTGGGGTGG TCGTTTTGAA AAAGAATCGG ATCACTTAAT GGAGGATTTT 
CATTCCTCCA TTTCTTTTGA TCAGAGATTG TATAAACAGG ATATTGCCGG CAGTATGGCC
CACGCCAGGA TGCTGGGCAA AGCAGGCATT ATCTCTAAAG CAGAAGCGGA GCGGATAGTA
GCAGGTTTAC AGGAAATACT GGCGGATATC GAAGCGGGTA AAATAGAGTT TTCCGTTGCC
GCGGAAGATA TTCATATGAA TATCGAAGAA CTGTTAACTC AAAGAACCGG AGAAGTTGGC
AAAAAACTGC ATACCGCTCG CAGCCGCAAT GACCAGGTGG CGCTGGACGT ACGCATGTAT
TTAAAAGAAG AGATAACAGA GGTCATGAAT CTTATAAAAT ATCTGCAGGA TACACTGGCG
GAGCTGGCCG AAGAACATCT GGATACGGTA CTGCCCGGCT ATACTCATTT ACAGAGGGCA
CAACCGGTAA CCCTGGCACA TCACCTGATG GCTTATTATC AGATGTTTAG CCGTGATCTG
GACAGGCTTG GCGACTGTTA CCGCCGCACT GATGTAATGC CTTTGGGTTC CGGGGCTCTG
GCCGGCACCA CGTTTGCCCT GGACCGGCAG TATGTGGCCG AACAGCTTGG GTTTGCCCGT
ATCAGTGAAA ACAGCCTGGA TGCCGTGGCG GATCGTGATT TTGCCGTAGA GTTTGCTTCG
GCCGCCTCTT TGATTATGAT GCACCTGAGC CGGTTTTGTG AAGAAATAAT TCTCTGGTCT
ACAGCGGAAT TTGCGTTCAT TGAATTGGAT GATGCTTACA GTACCGGCAG CAGCATGATG
CCTCAAAAGA AAAACCCGGA TGTGGCGGAA TTAATTCGTG GCAAAACAGG TAGAGTTTAC
GGCGATTTGC AGGCTCTTTT GACTATGCTG AAGGGCTTGC CGCTGGCCTA TAACAAAGAT
ATGCAGGAGG ATAAGGAAGC GCTGTTTGAT GCCGTTGATA CAGTAAAGAA ATGCTTAATG
TTGTTCCGGC CCATGCTGGC CACTGTAAAG GTGAAGAAAG AAAATATGGC AAGAGCCGCC
CGTGGTGGCT TTACCAACGC CACTGATTTA GCCGACTATT TGGTCTATAA GGGGGTACCT
TTCCGCCAGG CTCATGAAAT AGCCGGAAGA CTTGTTTTGT ACTGTTTGGC CAAGAAAAAG
ACACTGGAGG AAGTTAGCCT CGGGGAATAC AGGGAATTTT CCGATTTGAT AGCCGAGGAT
ATTTACCAGG CTATTGATAT AAATCATTGT GTGGAAGCCA GAAAGGTTTA TGGCGGGCCG
GCCAGGGCTG TTGTGCAGGA GGCTATAAAC AGAGCAAGGG GAAAGTTTTA A
 
Protein sequence
MAKLWGGRFE KESDHLMEDF HSSISFDQRL YKQDIAGSMA HARMLGKAGI ISKAEAERIV 
AGLQEILADI EAGKIEFSVA AEDIHMNIEE LLTQRTGEVG KKLHTARSRN DQVALDVRMY
LKEEITEVMN LIKYLQDTLA ELAEEHLDTV LPGYTHLQRA QPVTLAHHLM AYYQMFSRDL
DRLGDCYRRT DVMPLGSGAL AGTTFALDRQ YVAEQLGFAR ISENSLDAVA DRDFAVEFAS
AASLIMMHLS RFCEEIILWS TAEFAFIELD DAYSTGSSMM PQKKNPDVAE LIRGKTGRVY
GDLQALLTML KGLPLAYNKD MQEDKEALFD AVDTVKKCLM LFRPMLATVK VKKENMARAA
RGGFTNATDL ADYLVYKGVP FRQAHEIAGR LVLYCLAKKK TLEEVSLGEY REFSDLIAED
IYQAIDINHC VEARKVYGGP ARAVVQEAIN RARGKF