Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4727 |
Symbol | |
ID | 6797067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4618802 |
End bp | 4619935 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642778800 |
Product | dihydroorotase |
Protein accession | YP_002149362 |
Protein GI | 197249655 |
COG category | [R] General function prediction only |
COG ID | [COG3964] Predicted amidohydrolase |
TIGRFAM ID | [TIGR03583] probable amidohydrolase EF_0837/AHA_3915 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGATT TACTCCTGCG CCATGCGCGT CTGGTCGATG ACACGCTGAC TAATATTGCC CTGCAAGATG GCAAAATCGC GGCGTTGGGT GACGTTGATG GTCCGGCGCT GAAAACCATT GACCTGCGCG GCGAGTGTTA CGTTAGTGCG GGTTGGATTG ATTCTCATGT TCACTGCTAC CCGACATCAC CGATTTATCA CGACGAACCG GACAGCGTGG GTATTGCCAC TGGCGTTACC ACAGTGGTGG ATGCCGGTAG CACTGGCGCA GACGACATTG ATGATTTCTA TGCTCTGACG CGTCAGGCGA CCACCGACGT TTATGCGCTG CTGAATGTTT CACGCGTTGG GCTTATTGCC CAAAACGAGC TGGCTAACAT GGCCAATATT GACGCCGATG CGGTCCGGCA GGCGGTAAAA CGCCATCCGG ATTTTATCGT CGGCCTCAAG GCGCGGATGA GCAGCAGCGT GGTAGGCGTT AACGGCATCA CGCCGCTGGA ACACGCTAAA GCCATGCAGC AAGAAAACGG CAACCTACCG CTGATGGTGC ATATTGGCAA TAACCCGCCG GATCTGGACG AAATCGCGGA GCGTCTGACG GCGGGCGATA TCATTACCCA CTGTTACAAC GGTAAGCCGA ACCGTATTCT GACGCCGGAA GGCGAGCTGC GCGCCTCGGT GACACGAGCG CTGGCGCGCG GCGTGCGTCT GGACGTTGGA CATGGTACCG CCAGCCTGAG CTTTGCAGTG GCGAAACGCG CTATTAGCCT GGGGATTTTA CCGCATACCA TCAGTTCCGA TATCTACTGC CGTAACCGCA TCAATGGCCC GGTGCATTCG CTGGCTAATG TGATGTCGAA ATTCCTCGCC ATCGGCATGT CGCTGCCGCA GGTCATTGCG TGCGTGACGG CCAATGCCGC CGATAGCCTG AATCTGAAAA CCAAAGGTCG TCTTCAGCCT GGTCTGGATG CCGACCTGAC CCTCTTTACG CTTAAACGCC AGCCCACCGT GTTGGTAGAC GCGGAACACG ACAGCTTACA GGCTGAAGAA TTGCTGACGC CGCTTGCCGC GATACGCGCA GGCAAGGGCT ATATGACCGA AAAAGGGAGC GCGGAACATG CCTTCGATTT TTGA
|
Protein sequence | MFDLLLRHAR LVDDTLTNIA LQDGKIAALG DVDGPALKTI DLRGECYVSA GWIDSHVHCY PTSPIYHDEP DSVGIATGVT TVVDAGSTGA DDIDDFYALT RQATTDVYAL LNVSRVGLIA QNELANMANI DADAVRQAVK RHPDFIVGLK ARMSSSVVGV NGITPLEHAK AMQQENGNLP LMVHIGNNPP DLDEIAERLT AGDIITHCYN GKPNRILTPE GELRASVTRA LARGVRLDVG HGTASLSFAV AKRAISLGIL PHTISSDIYC RNRINGPVHS LANVMSKFLA IGMSLPQVIA CVTANAADSL NLKTKGRLQP GLDADLTLFT LKRQPTVLVD AEHDSLQAEE LLTPLAAIRA GKGYMTEKGS AEHAFDF
|
| |