Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2352 |
Symbol | |
ID | 8429336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 2523013 |
End bp | 2526237 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645034657 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003191786 |
Protein GI | 258515564 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.374511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTAA AAGAGGGTCT GAAAAAAGTG ATGGTCATCG GTTCCGGCCC GATCATCATC GGCCAGGCAG CCGAGTTTGA TTATGCAGGT ACCCAGGCTT GCCGGGCTTT GCGCGAAGAG GGTTTGGAAG TTGTGCTGGT CAATTCTAAT CCGGCTACCA TTATGACAGA CAGCAATATG GCTGACCGGA TATATATTGA TCCCCTGACA CCGGAATTTG TGACCAAGGT TCTGAAGAAA GAGAAGCCGG ACGGTCTCCT GCCCACTCTG GGCGGACAGG TAGGTTTAAA TATGGCTTTG CAGTTGGCTA AATCGGGTGT TTTGGAGGAA ACCGGAGTTC AACTGCTGGG CACACCGCTG GAAAGTATCA TCAAGGCGGA AGACAGGGAA GGCTTTAAAT CGATGATGCA GAGCATTAAT GAGCCTATAC CGGAGAGTGA CATTGTTTCC AGCGTGGAGG ATGCTGTTAA GTTTGCCGAG AGAGTCGGTT TCCCGTTGGT TGTGCGTCCT GCTTATACTC TTGGCGGCAC GGGCGGCGGA ATGGTTTATA ATATGGCTGA GTTAAAGAGT ACTGCCACCA GAGGCATCAG GCACAGTATT ATTAACCAGA TACTGGTAGA ACGCAGTTTA GTTGGCTGGA AGGAAATTGA GTTTGAGGTA ATGAGAGACA GTATGGATAA CTGTATAACC ATATGCAGTA TGGAAAACAT TGATCCTATG GGTATCCATA CCGGTGACAG CATTGTTGTG GCACCCTGTC AGACTTTAAG TGATAAGGAA TATCAGATGC TGAGATCAGC CTCACTAAAA ATTATCAGGG CTCTGGGTGT GCATGGCGGT TGTAATGTGC AGTATGCTTT GGACCCCGAC AGTTTCCAGT ACTATGTTAT TGAAGTTAAC CCGCGAGTTT CCCGTTCTTC CGCTCTGGCT TCCAAAGCTA CTGGTTACCC GATTGCCAAG GTGTCAGCCA AGATTGCCGT CGGACTGACA CTGGATGAAA TTAAAAATGC AGTTACCGGC AAGACTTATG CTTGTTTTGA GCCGACTATT GATTACGTGG TGTTCAAGTT TCCCCGCTGG CCTTTTGATA AGTTTGCCCT GGCTGACAGG CAACTGGGTA CTCAAATGAA GGCTACCGGT GAGGTAATGT CTATTGATCG GACTCTGGAA GGGGCCATTC TCAAAGCTGT ACGTTCTCTG GAAATCGGTT CACCCGGTCT GGCAGTTGAG GGAGCGGAGG CTTATACAGA GGAGCTTTTG GAAAGCAAGC TCTCTCATCC TGAGGATGAG AGACTTTTCC TGGTTGCGGA AGCTTTCAGA CGGGGTATGC TGATTGACCG GGTTCATGAA TTAACCAAGA TTGATCGATT TTTTCTGGAT AAAATATATA ATATAGTCAA AATGGAAAAG GAAATTAAGC TGGTTAAAGG GAATATTAAC AGGCTCAAAC CTGAACTCTT GACCAGAGCT AAAAAAATGG GGTTTGCTGA TGTCTATCTG GCCAAACTGG CTGGAGTGGA TTATGAGTTT TTGAGATCTT ATCGCAAGGA ACTGAATATT GTCCCTTCCT ACAAAATGGT TGATACCTGC GCGGCTGAAT TTGAAGCGGA AACTCCTTAT TTTTATTCTT CCTATGATCA GGAGAACGAA GCTTTGCCGT CGGAAAAAAG AAAAATCGTG GTGCTCGGTT CCGGCCCGAT TCGCATTGGC CAGGGTATAG AGTTTGATTA CTGCTCGGTT CATTCCGTTT GGGCTCTGCG GGAGGCAGGA ATTGAAGCCA TTATTATCAA CAATAACCCG GAAACTGTCA GCACTGACTT TGATACGGCG GATCGCTTAT ATTTTGAGCC TATGCTGCCG GAAGATGTAT TAAACATACT GGAGAATGAA AAACCGGAAG GTGTAATTGT GCAGTTTGGC GGGCAGACAG CTATAAATCT GGCTAAGCCT CTGGAAAATG CCGGTATCAA AATATTGGGA ACATCTGTGG AAAACATAGA CAGGGCAGAA GACAGAGAGC GCTTCGACCG TCTCTTAAGC AAACTTAATA TTCCCCGGCC GGCCGGCAAT ACCGTTTTTT CAGTTACTGA AGCTATCGGT GTGGCTGACG AAATAGGCTA CCCGGTGCTG GTGCGTCCTT CCTATGTGCT GGGTGGCAGG GCTATGGAGA TAGTTTATAA TGAAATTGAT TTATTGAACT ATATGGCTTC AGCAGTTAAG GTAACTCCTG AACACCCTGT ACTTGTTGAT AAGTACCTTT GCGGCAAAGA GCTGGAGGTA GACGCTATCT CTGACGGCAA GGATGTCTTA ATTCCGGGTA TTATGGAGCA TATTGAGCGT GCGGGAGTGC ATTCGGGCGA CAGTATAGCT GTCTATCCCC CGCGCAATCT GTCTGACAGA ATCAGAGGTT TGCTAGTGGA TTATACCACC AGGTTGGCTA AAGAGCTGAA TGTCAAAGGA GTAATCAATA TCCAGTATGT GCTGCATGAG GATCAGTTGT ATGTATTGGA AGTAAACCCT CGTTCCAGCC GTACCGTACC CTATATGAGC AAGATTACAG GCATACCCAT GGTCAATTTG GCTACAAAGA TAATTTTAGG GCAAACTCTG GCTGATTTGG GTTATTCAGC CGGTTTGTAT CCTGAATCCA AGTTTGTAGG TGTCAAGGTG CCTGTATTCT CCTTTGGCAA ATTGCTTCAG GTGGATATCT CCTTGGGGCC TGAAATGAAA TCTACCGGTG AAGTTATGGG TATGGATAAG AATTACTGTA TAGCAACATA CAAGGCTTTC GTGGCGGCAG GCTATGATTT TCCCAAACAC GGTACTATTT TGGTCACAGT AGCCGATAAG GATAAAGCTG AGGCACTGCC TATTATCAAA GGCCTGGCCG GTTTAGGCTA TAAAATATGT GCTACCAGAG GGACTGCTGA TTTTCTGGCC AGTGAAGGCC TTTCCGTTGA GTCTGTCAAT AAGGTTCATG AGGCTTCTCC CAATATTATT GATTTAATCA GGCAAAACTC CATCCACCTG GTAATCAATA CTTTAACCAA GGGTAAAGCT CCGGAGCGTG ACGGCTTCAG AATACGCAGG ACAGCTGTCG AACACGGTGT TCCCTGCCTG ACTTCTCTGG ATACGGCGCG TGCTATATAT GAAGTTCTGG GCGCGATAAA AGTTGGTGGG GATATAGAAC TTTTACCGCT CCAGGAGTAT ATGAAGAATA AATAG
|
Protein sequence | MPLKEGLKKV MVIGSGPIII GQAAEFDYAG TQACRALREE GLEVVLVNSN PATIMTDSNM ADRIYIDPLT PEFVTKVLKK EKPDGLLPTL GGQVGLNMAL QLAKSGVLEE TGVQLLGTPL ESIIKAEDRE GFKSMMQSIN EPIPESDIVS SVEDAVKFAE RVGFPLVVRP AYTLGGTGGG MVYNMAELKS TATRGIRHSI INQILVERSL VGWKEIEFEV MRDSMDNCIT ICSMENIDPM GIHTGDSIVV APCQTLSDKE YQMLRSASLK IIRALGVHGG CNVQYALDPD SFQYYVIEVN PRVSRSSALA SKATGYPIAK VSAKIAVGLT LDEIKNAVTG KTYACFEPTI DYVVFKFPRW PFDKFALADR QLGTQMKATG EVMSIDRTLE GAILKAVRSL EIGSPGLAVE GAEAYTEELL ESKLSHPEDE RLFLVAEAFR RGMLIDRVHE LTKIDRFFLD KIYNIVKMEK EIKLVKGNIN RLKPELLTRA KKMGFADVYL AKLAGVDYEF LRSYRKELNI VPSYKMVDTC AAEFEAETPY FYSSYDQENE ALPSEKRKIV VLGSGPIRIG QGIEFDYCSV HSVWALREAG IEAIIINNNP ETVSTDFDTA DRLYFEPMLP EDVLNILENE KPEGVIVQFG GQTAINLAKP LENAGIKILG TSVENIDRAE DRERFDRLLS KLNIPRPAGN TVFSVTEAIG VADEIGYPVL VRPSYVLGGR AMEIVYNEID LLNYMASAVK VTPEHPVLVD KYLCGKELEV DAISDGKDVL IPGIMEHIER AGVHSGDSIA VYPPRNLSDR IRGLLVDYTT RLAKELNVKG VINIQYVLHE DQLYVLEVNP RSSRTVPYMS KITGIPMVNL ATKIILGQTL ADLGYSAGLY PESKFVGVKV PVFSFGKLLQ VDISLGPEMK STGEVMGMDK NYCIATYKAF VAAGYDFPKH GTILVTVADK DKAEALPIIK GLAGLGYKIC ATRGTADFLA SEGLSVESVN KVHEASPNII DLIRQNSIHL VINTLTKGKA PERDGFRIRR TAVEHGVPCL TSLDTARAIY EVLGAIKVGG DIELLPLQEY MKNK
|
| |