Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A3527 |
Symbol | |
ID | 6517997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 3402211 |
End bp | 3403491 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642748513 |
Product | cytosine deaminase |
Protein accession | YP_002116283 |
Protein GI | 194737042 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAACA ATAACATCAC CATTCTTCAG GCGCGTCTGC AGGGACACGA AGGATTATGG CAGATTACGA TTGAAAACGG GCGCTTTAGC CGGATTGAGC CTCAGGAAGC CGCATCGTTA CCGCAGGGCG AAGTGCTTGA TGCCGAAGGC GGTCTGGCTA TTCCACCGTT TGTTGAGCCA CATATCCACC TGGATACCAC GCAAACGGCG GGTGAACCGA GCTGGAACCA GTCCGGCACA CTGTTTGAAG GGATTGAGCG CTGGGCAGAA CGTAAAGCAA TGCTCACGCA TGAAGATGTG AAAGCGCGCG CGATGCAGAC CCTGAAATGG CAGATGGCTA ACGGCATCCA GTACGTTCGT ACTCACGTTG ACGTTTCAGA TCCTACGCTC ACCGCGCTGA AAGCCATGCT GGAAGTGAAG CAAGAGGTTG CGCCGTGGGT AGACCTACAA ATCGTCGCCT TTCCGCAAGA AGGTATTCTG TCTTATCCCA ACGGTGAAGC GCTGTTAGAG GAAGCCGTGC GTTTGGGCGC TGACGTTATT GGTGCCATTC CGCACTTTGA GTTCACGCGC GAATATGGCG TTGAATCGCT GCACAAAATC TTTGCACTGG CGCAGAAATA CGATCGCTTG ATCGATGTGC ACTGCGACGA AATTGATGAC GAGCAGTCTC GCTTTGTCGA AACGGTCGCG GCGCTGGCGC ATCGTGATGG CATGGGGGCG CGCGTGACCG CCAGCCACAC CACGGCTATG CACTCGTACA ACGGCGCGTA TGCGTCACGC CTGTTCCGCC TGTTGAAAAT GTCGGGGATT AACTTCGTCG CCAACCCGCT GGTGAATATT CATCTGCAAG GGCGGTTTGA CACTTACCCG AAACGTCGTG GCGTCACGCG GGTGAAAGAG ATGCTGGAAG CGGGGATCAA CGTCTGCTTT GGCCATGATG ACGTCTTCGA TCCGTGGTAT CCATTAGGCA CGGCAAATAT GCTGCAGGTG CTGCATATGG GGTTACACGT TTGCCAGTTG ATGGGGTATG GGCAAATCAA CGATGGGTTA AATCTGATCA CTACCCACAG CGCAAAAACC CTGCATTTAC AGGACTACAG TCTGAGCGTC GGCAATGCCG CGAATCTGGT TATCTTACCC GCTGAGAATG GATTCGATGC GGTACGTCGC CAGACGCCTG CCCGTTACTC GATTCGCCAC GGGCGGGTAA TTGCCGAGAC GGTGCCGAGC CAGACGACGC TGCACCTGCC CCAGCCGGAA GCCGTGACGT TTAAGCGTTA A
|
Protein sequence | MQNNNITILQ ARLQGHEGLW QITIENGRFS RIEPQEAASL PQGEVLDAEG GLAIPPFVEP HIHLDTTQTA GEPSWNQSGT LFEGIERWAE RKAMLTHEDV KARAMQTLKW QMANGIQYVR THVDVSDPTL TALKAMLEVK QEVAPWVDLQ IVAFPQEGIL SYPNGEALLE EAVRLGADVI GAIPHFEFTR EYGVESLHKI FALAQKYDRL IDVHCDEIDD EQSRFVETVA ALAHRDGMGA RVTASHTTAM HSYNGAYASR LFRLLKMSGI NFVANPLVNI HLQGRFDTYP KRRGVTRVKE MLEAGINVCF GHDDVFDPWY PLGTANMLQV LHMGLHVCQL MGYGQINDGL NLITTHSAKT LHLQDYSLSV GNAANLVILP AENGFDAVRR QTPARYSIRH GRVIAETVPS QTTLHLPQPE AVTFKR
|
| |