Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3525 |
Symbol | |
ID | 6795978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 3420290 |
End bp | 3421570 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642777656 |
Product | cytosine deaminase |
Protein accession | YP_002148258 |
Protein GI | 197247592 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAACA ATAACATCAC TATTCGTCAG ACGCGTCTGC AGGGACACGA AGGATTATGG CAGATTACGA TTGAAAACGG GCGCTTTAGC CGGATTGAGC CTCAGGAAGC CGCATCGTTA CCGCAGGGCG AAGTGCTTGA TGCCGAAGGC GGCCTGGCTA TTCCACCGTT TGTTGAGCCA CATATCCACC TGGATACCAC GCAAACGGCG GGTGAACCGA GCTGGAACCA GTCCGGCACA CTGTTTGAAG GGATTGAGCG CTGGGCAGAA CGTAAAGCAA TGCTCACGCA TGAAGATGTG AAAGCGCGCG CGATGCAGAC CCTGAAATGG CAGATGGCTA ACGGCATCCA GTACGTTCGT ACTCACGTTG ACGTTTCAGA TCCTACGCTC ACCGCGCTGA AAGCCATGCT GGAAGTGAAG CAAGAGGTTG CGCCGTGGGT AGATCTGCAA ATCGTCGCCT TTCCGCAAGA AGGTATTCTG TCTTATCCCA ACGGTGAAGC GCTGTTAGAG GAAGCCGTGC GTTTGGGCGC TGACGTTATT GGCGCCATTC CGCACTTTGA GTTCACGCGG GAATATGGCG TTGAATCGCT GCACAAAATC TTTGCACTGG CGCAGAAATA CGATCGCTTG ATCGATGTGC ACTGCGACGA AATTGATGAC GAGCAGTCTC GCTTTGTCGA AACGGTCGCG GCGCTGGCGC ATCGTGATGG CATGGGGGCG CGCGTGACCG CCAGCCACAC CACGGCTATG CACTCGTACA ACGGCGCGTA TGCGTCACGC CTGTTCCGCC TGTTGAAAAT GTCGGGGATT AACTTCGTCG CCAACCCGCT GGTGAATATT CATCTGCAAG GGCGGTTTGA CACTTACCCG AAACGTCGTG GCGTCACGCG AGTGAAAGAG ATGCTGGAAG CGGGGATCAA CGTCTGCTTT GGCCATGATG ACGTCTTCGA CCCGTGGTAT CCATTAGGCA CAGCAAATAT GCTGCAGGTG TTGCATATGG GGTTACACGT TTGCCAGTTG ATGGGGTATG GGCAAATCAA CGACGGGTTA AATCTGATCA CTACCCACAG CGCAAAAACC CTGCATTTAC AGGACTACGG TCTGAGCGTC GGCAATGCCG CGAATCTGGT TATCTTACCC GCTGAGAATG GATTCGATGC GGTACGTCGC CAGACGCCTG CCCGTTACTC GATTCGCCAC GGGCGGGTGA TTGCCGAGAC GGTGCCGAGC CAGACGACGC TGCACCTGAC CCAGCCGGAA GCCGTGACGT TTAAGCGTTA A
|
Protein sequence | MQNNNITIRQ TRLQGHEGLW QITIENGRFS RIEPQEAASL PQGEVLDAEG GLAIPPFVEP HIHLDTTQTA GEPSWNQSGT LFEGIERWAE RKAMLTHEDV KARAMQTLKW QMANGIQYVR THVDVSDPTL TALKAMLEVK QEVAPWVDLQ IVAFPQEGIL SYPNGEALLE EAVRLGADVI GAIPHFEFTR EYGVESLHKI FALAQKYDRL IDVHCDEIDD EQSRFVETVA ALAHRDGMGA RVTASHTTAM HSYNGAYASR LFRLLKMSGI NFVANPLVNI HLQGRFDTYP KRRGVTRVKE MLEAGINVCF GHDDVFDPWY PLGTANMLQV LHMGLHVCQL MGYGQINDGL NLITTHSAKT LHLQDYGLSV GNAANLVILP AENGFDAVRR QTPARYSIRH GRVIAETVPS QTTLHLTQPE AVTFKR
|
| |