Gene Dtox_2352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2352 
Symbol 
ID8429336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2523013 
End bp2526237 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content46% 
IMG OID645034657 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003191786 
Protein GI258515564 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.374511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTAA AAGAGGGTCT GAAAAAAGTG ATGGTCATCG GTTCCGGCCC GATCATCATC 
GGCCAGGCAG CCGAGTTTGA TTATGCAGGT ACCCAGGCTT GCCGGGCTTT GCGCGAAGAG
GGTTTGGAAG TTGTGCTGGT CAATTCTAAT CCGGCTACCA TTATGACAGA CAGCAATATG
GCTGACCGGA TATATATTGA TCCCCTGACA CCGGAATTTG TGACCAAGGT TCTGAAGAAA
GAGAAGCCGG ACGGTCTCCT GCCCACTCTG GGCGGACAGG TAGGTTTAAA TATGGCTTTG
CAGTTGGCTA AATCGGGTGT TTTGGAGGAA ACCGGAGTTC AACTGCTGGG CACACCGCTG
GAAAGTATCA TCAAGGCGGA AGACAGGGAA GGCTTTAAAT CGATGATGCA GAGCATTAAT
GAGCCTATAC CGGAGAGTGA CATTGTTTCC AGCGTGGAGG ATGCTGTTAA GTTTGCCGAG
AGAGTCGGTT TCCCGTTGGT TGTGCGTCCT GCTTATACTC TTGGCGGCAC GGGCGGCGGA
ATGGTTTATA ATATGGCTGA GTTAAAGAGT ACTGCCACCA GAGGCATCAG GCACAGTATT
ATTAACCAGA TACTGGTAGA ACGCAGTTTA GTTGGCTGGA AGGAAATTGA GTTTGAGGTA
ATGAGAGACA GTATGGATAA CTGTATAACC ATATGCAGTA TGGAAAACAT TGATCCTATG
GGTATCCATA CCGGTGACAG CATTGTTGTG GCACCCTGTC AGACTTTAAG TGATAAGGAA
TATCAGATGC TGAGATCAGC CTCACTAAAA ATTATCAGGG CTCTGGGTGT GCATGGCGGT
TGTAATGTGC AGTATGCTTT GGACCCCGAC AGTTTCCAGT ACTATGTTAT TGAAGTTAAC
CCGCGAGTTT CCCGTTCTTC CGCTCTGGCT TCCAAAGCTA CTGGTTACCC GATTGCCAAG
GTGTCAGCCA AGATTGCCGT CGGACTGACA CTGGATGAAA TTAAAAATGC AGTTACCGGC
AAGACTTATG CTTGTTTTGA GCCGACTATT GATTACGTGG TGTTCAAGTT TCCCCGCTGG
CCTTTTGATA AGTTTGCCCT GGCTGACAGG CAACTGGGTA CTCAAATGAA GGCTACCGGT
GAGGTAATGT CTATTGATCG GACTCTGGAA GGGGCCATTC TCAAAGCTGT ACGTTCTCTG
GAAATCGGTT CACCCGGTCT GGCAGTTGAG GGAGCGGAGG CTTATACAGA GGAGCTTTTG
GAAAGCAAGC TCTCTCATCC TGAGGATGAG AGACTTTTCC TGGTTGCGGA AGCTTTCAGA
CGGGGTATGC TGATTGACCG GGTTCATGAA TTAACCAAGA TTGATCGATT TTTTCTGGAT
AAAATATATA ATATAGTCAA AATGGAAAAG GAAATTAAGC TGGTTAAAGG GAATATTAAC
AGGCTCAAAC CTGAACTCTT GACCAGAGCT AAAAAAATGG GGTTTGCTGA TGTCTATCTG
GCCAAACTGG CTGGAGTGGA TTATGAGTTT TTGAGATCTT ATCGCAAGGA ACTGAATATT
GTCCCTTCCT ACAAAATGGT TGATACCTGC GCGGCTGAAT TTGAAGCGGA AACTCCTTAT
TTTTATTCTT CCTATGATCA GGAGAACGAA GCTTTGCCGT CGGAAAAAAG AAAAATCGTG
GTGCTCGGTT CCGGCCCGAT TCGCATTGGC CAGGGTATAG AGTTTGATTA CTGCTCGGTT
CATTCCGTTT GGGCTCTGCG GGAGGCAGGA ATTGAAGCCA TTATTATCAA CAATAACCCG
GAAACTGTCA GCACTGACTT TGATACGGCG GATCGCTTAT ATTTTGAGCC TATGCTGCCG
GAAGATGTAT TAAACATACT GGAGAATGAA AAACCGGAAG GTGTAATTGT GCAGTTTGGC
GGGCAGACAG CTATAAATCT GGCTAAGCCT CTGGAAAATG CCGGTATCAA AATATTGGGA
ACATCTGTGG AAAACATAGA CAGGGCAGAA GACAGAGAGC GCTTCGACCG TCTCTTAAGC
AAACTTAATA TTCCCCGGCC GGCCGGCAAT ACCGTTTTTT CAGTTACTGA AGCTATCGGT
GTGGCTGACG AAATAGGCTA CCCGGTGCTG GTGCGTCCTT CCTATGTGCT GGGTGGCAGG
GCTATGGAGA TAGTTTATAA TGAAATTGAT TTATTGAACT ATATGGCTTC AGCAGTTAAG
GTAACTCCTG AACACCCTGT ACTTGTTGAT AAGTACCTTT GCGGCAAAGA GCTGGAGGTA
GACGCTATCT CTGACGGCAA GGATGTCTTA ATTCCGGGTA TTATGGAGCA TATTGAGCGT
GCGGGAGTGC ATTCGGGCGA CAGTATAGCT GTCTATCCCC CGCGCAATCT GTCTGACAGA
ATCAGAGGTT TGCTAGTGGA TTATACCACC AGGTTGGCTA AAGAGCTGAA TGTCAAAGGA
GTAATCAATA TCCAGTATGT GCTGCATGAG GATCAGTTGT ATGTATTGGA AGTAAACCCT
CGTTCCAGCC GTACCGTACC CTATATGAGC AAGATTACAG GCATACCCAT GGTCAATTTG
GCTACAAAGA TAATTTTAGG GCAAACTCTG GCTGATTTGG GTTATTCAGC CGGTTTGTAT
CCTGAATCCA AGTTTGTAGG TGTCAAGGTG CCTGTATTCT CCTTTGGCAA ATTGCTTCAG
GTGGATATCT CCTTGGGGCC TGAAATGAAA TCTACCGGTG AAGTTATGGG TATGGATAAG
AATTACTGTA TAGCAACATA CAAGGCTTTC GTGGCGGCAG GCTATGATTT TCCCAAACAC
GGTACTATTT TGGTCACAGT AGCCGATAAG GATAAAGCTG AGGCACTGCC TATTATCAAA
GGCCTGGCCG GTTTAGGCTA TAAAATATGT GCTACCAGAG GGACTGCTGA TTTTCTGGCC
AGTGAAGGCC TTTCCGTTGA GTCTGTCAAT AAGGTTCATG AGGCTTCTCC CAATATTATT
GATTTAATCA GGCAAAACTC CATCCACCTG GTAATCAATA CTTTAACCAA GGGTAAAGCT
CCGGAGCGTG ACGGCTTCAG AATACGCAGG ACAGCTGTCG AACACGGTGT TCCCTGCCTG
ACTTCTCTGG ATACGGCGCG TGCTATATAT GAAGTTCTGG GCGCGATAAA AGTTGGTGGG
GATATAGAAC TTTTACCGCT CCAGGAGTAT ATGAAGAATA AATAG
 
Protein sequence
MPLKEGLKKV MVIGSGPIII GQAAEFDYAG TQACRALREE GLEVVLVNSN PATIMTDSNM 
ADRIYIDPLT PEFVTKVLKK EKPDGLLPTL GGQVGLNMAL QLAKSGVLEE TGVQLLGTPL
ESIIKAEDRE GFKSMMQSIN EPIPESDIVS SVEDAVKFAE RVGFPLVVRP AYTLGGTGGG
MVYNMAELKS TATRGIRHSI INQILVERSL VGWKEIEFEV MRDSMDNCIT ICSMENIDPM
GIHTGDSIVV APCQTLSDKE YQMLRSASLK IIRALGVHGG CNVQYALDPD SFQYYVIEVN
PRVSRSSALA SKATGYPIAK VSAKIAVGLT LDEIKNAVTG KTYACFEPTI DYVVFKFPRW
PFDKFALADR QLGTQMKATG EVMSIDRTLE GAILKAVRSL EIGSPGLAVE GAEAYTEELL
ESKLSHPEDE RLFLVAEAFR RGMLIDRVHE LTKIDRFFLD KIYNIVKMEK EIKLVKGNIN
RLKPELLTRA KKMGFADVYL AKLAGVDYEF LRSYRKELNI VPSYKMVDTC AAEFEAETPY
FYSSYDQENE ALPSEKRKIV VLGSGPIRIG QGIEFDYCSV HSVWALREAG IEAIIINNNP
ETVSTDFDTA DRLYFEPMLP EDVLNILENE KPEGVIVQFG GQTAINLAKP LENAGIKILG
TSVENIDRAE DRERFDRLLS KLNIPRPAGN TVFSVTEAIG VADEIGYPVL VRPSYVLGGR
AMEIVYNEID LLNYMASAVK VTPEHPVLVD KYLCGKELEV DAISDGKDVL IPGIMEHIER
AGVHSGDSIA VYPPRNLSDR IRGLLVDYTT RLAKELNVKG VINIQYVLHE DQLYVLEVNP
RSSRTVPYMS KITGIPMVNL ATKIILGQTL ADLGYSAGLY PESKFVGVKV PVFSFGKLLQ
VDISLGPEMK STGEVMGMDK NYCIATYKAF VAAGYDFPKH GTILVTVADK DKAEALPIIK
GLAGLGYKIC ATRGTADFLA SEGLSVESVN KVHEASPNII DLIRQNSIHL VINTLTKGKA
PERDGFRIRR TAVEHGVPCL TSLDTARAIY EVLGAIKVGG DIELLPLQEY MKNK