Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3694 |
Symbol | |
ID | 6871716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3543913 |
End bp | 3545193 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642786670 |
Product | cytosine deaminase |
Protein accession | YP_002217304 |
Protein GI | 198241968 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAACA ATAACATCAC TATTCGTCAG ACGCGTCTGC AGGGACACGA AGGATTATGG CAGATTACGA TTGAAAACGG GCGCTTTAGC CGGATTGAGC CTCAGGAAGC CGCATCGTTA CCGCAGGGCG AAGTGCTTGA TGCCGAAGGC GGCCTGGCTA TTCCACCGTT TGTTGAGCCA CATATCCACC TGGATACCAC GCAAACGGCG GGTGAACCGA GCTGGAACCA GTCCGGCACA CTGTTTGAAG GGATTGAGCG CTGGGCAGAA CGTAAAGCAA TGCTCACGCA TGAAGATGTG AAAGCGCGCG CGATGCAGAC CCTGAAATGG CAGATGGCTA ACGGCATCCA GTACGTTCGT ACTCACGTTG ACGTTTCAGA TCCTACGCTC ACCGCGCTGA AAGCCATGCT GGAAGTGAAG CAAGAGGTTG CGCCGTGGGT AGACCTGCAA ATTGTCGCCT TTCCGCAAGA AGGTATTCTG TCTTATCCCA ACGGTGAAGC GCTGTTAGAG GAAGCCGTGC GTTTGGGCGT TGACGTTATT GGCGCCATTC CGCACTTTGA ATTTACGCGT GAATATGGCG TTGAATCGCT GCACAAAACC TTCGCCCTGG CGCAAAAATA CGATCGCTTG ATCGATGTGC ACTGCGACGA AATTGATGAC GAGCAGTCTC GCTTTGTCGA AACGGTCGCG GCGCTGGCGC ATCGCGATGG CATGGGGGCG CGCGTGACCG CCAGCCACAC CACGGCTATG CACTCGTACA ACGGCGCGTA TGCGTCACGC CTGTTCCGTC TGTTGAAAAT GTCGGGGATT AACTTCGTCG CCAACCCGCT GGTGAATATT CATCTGCAAG GACGGTTTGA CACTTACCCG AAACGTCGTG GCGTCACGCG AGTGAAAGAG ATGCTGGAAG CGGGGATCAA CGTCTGCTTT GGCCATGATG ACGTCTTCGA TCCGTGGTAT CCATTAGGCA CGGCAAATAT GCTGCAGGTG CTGCATATGG GGTTACACGT TTGCCAGTTG ATGGGGTATG GGCAAATCAA CGACGGGTTA AATCTGATCA CTACCCACAG CGCAAAAACC CTGCATTTAC AGGACTACGG TCTGAGCGTC GGCAATGCCG CGAATCTGGT TATCTTACCC GCTGAGAATG GATTCGATGC GGTACGTCGC CAGACGCCTG CGCGTTACTC GATTCGCCAC GGGCGAGTGA TTGCCGAGAC GGTGCCGAGC CAGACGACGC TGCACCTGAC CCAGCCGGAA GCCGTGACGT TTAAGCGTTA A
|
Protein sequence | MQNNNITIRQ TRLQGHEGLW QITIENGRFS RIEPQEAASL PQGEVLDAEG GLAIPPFVEP HIHLDTTQTA GEPSWNQSGT LFEGIERWAE RKAMLTHEDV KARAMQTLKW QMANGIQYVR THVDVSDPTL TALKAMLEVK QEVAPWVDLQ IVAFPQEGIL SYPNGEALLE EAVRLGVDVI GAIPHFEFTR EYGVESLHKT FALAQKYDRL IDVHCDEIDD EQSRFVETVA ALAHRDGMGA RVTASHTTAM HSYNGAYASR LFRLLKMSGI NFVANPLVNI HLQGRFDTYP KRRGVTRVKE MLEAGINVCF GHDDVFDPWY PLGTANMLQV LHMGLHVCQL MGYGQINDGL NLITTHSAKT LHLQDYGLSV GNAANLVILP AENGFDAVRR QTPARYSIRH GRVIAETVPS QTTLHLTQPE AVTFKR
|
| |