Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4714 |
Symbol | |
ID | 6794697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4607102 |
End bp | 4609042 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642778787 |
Product | probable malonic semialdehyde oxidative decarboxylase |
Protein accession | YP_002149349 |
Protein GI | 197251105 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.163823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAA TCAGGTTGAC CATGGCGCAG GCTTTGGTGC GCTTTCTTGA TAATCAGTAC ATCGACGTAG ACGGCAGCGA AATCAAATTT GTAAAAGGGA TTTTCGCCAT TTTTGGCCAC GGGAACGTCG TCGGATTGGG GCAAGCGCTG GAAGAGGACT GTGGCCAACT TAGCGTTCAT CAAGGGCGTA ACGAACAGGG AATGGCGCAT ATCGCGACGG GATTTGCCCG CCAGATGCGT CGCCATCAGA TTTATGCCTG CACCTCCTCA GTGGGGCCAG GGGCCGCCAA TATGATCACC GCAGCGGCGA CCGCGACGGC TAACCGTATT CCACTGCTTT TGCTGCCGGG CGATGTGTAC GCATCTCGTC AACCCGACCC GGTTTTGCAA CAGGTTGAAC AAGAACACGA TCTGACGCTG AGCACCAATG ACGCTTTCCG TGCAGTTAGC CGCTACTGGG ATCGCATTAC GCGTCCGGAA CAGCTAATGA GCGCCTGTAT CAGCGCGATG CGGGTGTTAA CCGATCCGGC GGATACGGGG GCCGTGACGC TTTGCCTGCC ACAGGATGTG CAGGGTGAAG CCTGGGATTA TCCGGATTAT TTTTTCGCTC GCCGGGTCTA TCGTCTTGAG CGTCACGCGC CGACGGAGCC GATGCTGAAC GAGGCGGTTG CGCTGATTCG CCGCCACCAG CGGCCGCTGA TCGTTTGCGG TGGGGGAGTG AAGTATTCGC AGGCTGAAGA GGCGCTGCTG AGATTTGCCG AACGCTGTCA TCTGCCGATT GCTGAAACCC AGGCTGGCAA GGGAGCGCTC AGTTCTGCAC ACCCGTTGAA CGTCGGCGGG ATTGGCGAAA CCGGTTCACT GGCGGCGAAT CTGCTGGCGC AGGAAGCCGA TCTGATTATC GGTGTGGGGA CGCGCTATAC CGATTTCACC ACCTCCTCAA AGTGGATCTT CCAGAATCCC GACGTGCGCT ACTTAAATAT CAACGTTAGC CGCTTTGATG TCTTCAAGCT GGATGGCGTA CAGATGCAGG GTGACGCTCG CGTCGCCCTG ACGCAGCTTA GCGAACGGCT GGCCCAGGAG CATTATGCTT CGCAATGGGG TGAGACTATT CACCGCGTCC GCTCGCAATA TATGGCGGAA GTTGAGCGCG TCTATGCTGT GGAATATAGC GGAGAGGGCT TCAAACCTGA AATTGAGGAT CATATGGATA CTCAAAAGGT GTTTGAAGAG TTTAATGAGA TTACGCGATC GTGGCTGACC CAGACGCGCG TGTTGGGCGT GCTTAACCGG ATGTTGCCGG AAAACGCGCT GGTGGTGGCG GCGGCGGGCA GCCTGCCGGG CGACCTCCAG CGTGTCTGGC AAAGCCGCGG CGAGAATGAT TACCACGTCG AGTACGGCTA CTCCTGTATG GGCTACGAAG TCAATGCCGC ATTGGGGGCC AAGCTGGCGC AGCCGGAACG CGAGGTGTAC AGCTTCGTGG GCGACGGTTC GTTCATGATG CTGCACTCTG AGTTGGTCAC TTCCGTCCAG ATGGGGAAAA AGATTACCGT CATTTTGCTC GATAACATGA CCAACGGCTG TATCAATAAT CTGCAAATGG AACACGGTAT GGACAGTTAC TTCACCGAGT TTCGTTTCCA TCAGCAGGAG AGCGGTCGTC AGGAAGGCGG GTTTATCCCG GTCGATTTCG CTCGCATCGC TGAAGGATAT GGCTGTAAAA GCTATCGCGT CACCACCATT GAACAACTGC ATGAAGCGTT GGAAGATGCT CGTAAACAGA CCGTGAGTAC GCTGATAGAC ATAAAAGTGC TTCCCAAAAC GATGGTGCAT AAGTACCTGA GCTGGTGGCG CGTTGGTGGG GCGCAGGTAT CCCGTAGCGA ACGTATCCAG GCGATAGCGC GTATGCTTGA GGAACATATC GGACAGGCCC GGCAGTACTG A
|
Protein sequence | MKTIRLTMAQ ALVRFLDNQY IDVDGSEIKF VKGIFAIFGH GNVVGLGQAL EEDCGQLSVH QGRNEQGMAH IATGFARQMR RHQIYACTSS VGPGAANMIT AAATATANRI PLLLLPGDVY ASRQPDPVLQ QVEQEHDLTL STNDAFRAVS RYWDRITRPE QLMSACISAM RVLTDPADTG AVTLCLPQDV QGEAWDYPDY FFARRVYRLE RHAPTEPMLN EAVALIRRHQ RPLIVCGGGV KYSQAEEALL RFAERCHLPI AETQAGKGAL SSAHPLNVGG IGETGSLAAN LLAQEADLII GVGTRYTDFT TSSKWIFQNP DVRYLNINVS RFDVFKLDGV QMQGDARVAL TQLSERLAQE HYASQWGETI HRVRSQYMAE VERVYAVEYS GEGFKPEIED HMDTQKVFEE FNEITRSWLT QTRVLGVLNR MLPENALVVA AAGSLPGDLQ RVWQSRGEND YHVEYGYSCM GYEVNAALGA KLAQPEREVY SFVGDGSFMM LHSELVTSVQ MGKKITVILL DNMTNGCINN LQMEHGMDSY FTEFRFHQQE SGRQEGGFIP VDFARIAEGY GCKSYRVTTI EQLHEALEDA RKQTVSTLID IKVLPKTMVH KYLSWWRVGG AQVSRSERIQ AIARMLEEHI GQARQY
|
| |