Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1557 |
Symbol | |
ID | 6793499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 1516275 |
End bp | 1518239 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642775796 |
Product | peptidase, U32 family |
Protein accession | YP_002146432 |
Protein GI | 197248351 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCAGC AACCTCACTA TCTCGAATTG TTAAGTCCGG CCCGTGACGC CGCAATTGCT CGCGAAGCGA TTTTGCATGG CGCAGATGCC GTCTACATCG GCGGACCCGG TTTTGGCGCA CGTCATAACG CCAGTAACAG TCTGCGCGAT ATCGCCGATC TGGTCCCGTT TGCTCACCGT TACGGCGCCA GGATTTTTGT CACGCTGAAT ACTATCCTGC ATGATGATGA GCTGGAGCCC GCGCAGCGGT TAATCACCGA TTTGTACAAC ACCGGGGTGG ATGCGCTGAT TGTGCAGGAT ATGGGCATTC TGGAACTGGA TATCCCGCCG ATTGAGCTTC ACGCCAGTAC ACAGTGTGAT ATTCGCAGCG TGGAAAAAGC GAAGTTTCTT GCCGATGTCG GTTTTTCACA GATTGTACTG GCGCGCGAGC TTAATTTGAG TCAGATAGCG GCTATTCATC AGGCTACTGA CGCCACGATT GAGTTCTTCA TTCATGGCGC GCTGTGTGTC GCTTATTCTG GGCAGTGTTA TATCTCTCAT GCGCAAACCG GGCGCAGCGC CAATCGGGGC GACTGTTCGC AGGCCTGTCG TTTACCGTAT ACGTTAAAAG ACGATCAGGG GCGGGTGGTC TCTTACGAAA AACATTTGCT ATCGATGAAA GATAACGACC AAACGGCTAA CCTCGGCGCG TTGATCGATG CAGGCGTACG TTCCTTCAAG ATTGAAGGGC GCTACAAAGA CATGAGCTAT GTCAAAAACA TCACCGCGCA TTATCGTCAG ATGCTGGACG CGATTATCGA GCAACGTGGC GATCTGGCGC GTGCATCGGT TGGTCGGACC GAGCATTTTT TTGTTCCCTC CACGGAGAAA ACCTTCCATC GCGGCAGCAC CGACTATTTT GTTAACGCGC GTAAAGGTGA TATTGGCGCA TTTGATTCAC CAAAATTTAT TGGCTTGCCG GTAGGCGAGG TGCTGAATGT GGCGAAGGAT TATCTCGACG TAGAAGCTAC GGAGCCGTTG GCGAATGGCG ATGGTCTGAA CGTGTTGATT AAGCGTGAAG TGGTGGGTTT TCGCGCCAAT ACGGTGGAGA AAACCGGTCA TAACCGCTAC CGCGTTTGGC CAAATGATAT GCCTGCCGAC CTGCATAAAG TCCGTCCGCA TCATCCGTTG AATCGTAATC TGGATCATAA CTGGCAGCAA GCGCTGACAA AAACCTCCAG TGAGCGCCGT GTGGCGGTTG ATATCATGCT GGGCGGCTGG CAGGAACAGC TTATTCTGAC GCTGACCAGT GAAGACGGTG TCTGCATCAC GCATACGCTT GACGGGGTAT TTGAGGAAGC CAATAACTCT GAAAAAGCGT TGAATAACCT AAAAGCCGGA CTGGCGAAGC TGGGACAGAC GCCTTACTAC GCGCGTGATA TGCAGGTGAC ATTACCGGCG GCGTTGTTCG TGCCAAATAG CCTGCTCAAT CAGTTCCGTC GGGAGGCGAT TGATATGCTT GACGCGGCGC GGCTGGCCCA TTATCAACGA GGTCGTCGGA AACCCGTGGC GCAGCCTGCG CCGGTCTATC CGCAAACGCA TCTCAGCTTT CTCGCTAATG TCTACAACCA CAAAGCGCGG GAATTTTATC ACCGTTACGG CGTACAATTG ATTGATGCGG CCTATGAGGC GCATCAGGAG AAGGGCGAGG TACCGGTCAT GATCACCAAA CACTGCCTGC GTTTTGCGTT CAACCTTTGT CCTAAGCAGG CGAAAGGAAA TATTAAGAGC TGGAAAGCCA CGCCGATGCA GTTGGTGCAT GGCGATGAGG TACTGACGCT AAAATTCGAC TGCCGCCCTT GCGAAATGCA TGTCATTGGC AAAATTAAAA ACCACCTCTT AAAAATGCCC CAGCCCGGCA GCGTTGTCGC TTCAGTGAGC CCTGAAGCGC TGATGAAAAC GCTGCCGAAG CGCAGGGGCG TTTAA
|
Protein sequence | MRQQPHYLEL LSPARDAAIA REAILHGADA VYIGGPGFGA RHNASNSLRD IADLVPFAHR YGARIFVTLN TILHDDELEP AQRLITDLYN TGVDALIVQD MGILELDIPP IELHASTQCD IRSVEKAKFL ADVGFSQIVL ARELNLSQIA AIHQATDATI EFFIHGALCV AYSGQCYISH AQTGRSANRG DCSQACRLPY TLKDDQGRVV SYEKHLLSMK DNDQTANLGA LIDAGVRSFK IEGRYKDMSY VKNITAHYRQ MLDAIIEQRG DLARASVGRT EHFFVPSTEK TFHRGSTDYF VNARKGDIGA FDSPKFIGLP VGEVLNVAKD YLDVEATEPL ANGDGLNVLI KREVVGFRAN TVEKTGHNRY RVWPNDMPAD LHKVRPHHPL NRNLDHNWQQ ALTKTSSERR VAVDIMLGGW QEQLILTLTS EDGVCITHTL DGVFEEANNS EKALNNLKAG LAKLGQTPYY ARDMQVTLPA ALFVPNSLLN QFRREAIDML DAARLAHYQR GRRKPVAQPA PVYPQTHLSF LANVYNHKAR EFYHRYGVQL IDAAYEAHQE KGEVPVMITK HCLRFAFNLC PKQAKGNIKS WKATPMQLVH GDEVLTLKFD CRPCEMHVIG KIKNHLLKMP QPGSVVASVS PEALMKTLPK RRGV
|
| |