Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1718 |
Symbol | |
ID | 6486517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 1689988 |
End bp | 1691952 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642737098 |
Product | peptidase, U32 family |
Protein accession | YP_002040850 |
Protein GI | 194444629 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.378979 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCAGC AACCTCACTA TCTCGAATTG TTAAGTCCGG CCCGTGACGC CGCAATTGCT CGCGAAGCGA TTTTGCATGG CGCAGATGCC GTCTACATCG GCGGACCCGG TTTTGGCGCA CGTCATAACG CCAGTAACAG TCTGCGCGAT ATCGCCGATC TGGTTCCGTT TGCTCACCGT TACGGCGCCA GGATTTTTGT CACGCTGAAT ACTATTCTGC ATGATGATGA GCTGGAGCCC GCGCAGCGGT TAATCACCGA TTTGTACAAC ACCGGGGTGG ATGCGCTGAT TGTGCAGGAT ATGGGCATTC TGGAACTGGA TATCCCGCCG ATTGAGCTTC ACGCCAGTAC ACAGTGTGAT ATTCGCAGCG TGGAAAAAGC GAAGTTTCTT GCCGATGTCG GTTTTTCACA GATTGTACTG GCGCGCGAAC TTAATTTGAG TCAGATAGCG GCTATTCATC AGGCTACTGA CGCCACAATT GAGTTCTTCA TTCATGGCGC GCTGTGTGTC GCTTATTCTG GGCAGTGTTA TATCTCTCAT GCGCAAACCG GGCGCAGCGC CAATCGGGGC GACTGTTCGC AGGCCTGTCG TTTACCGTAT ACGTTAAAAG ACGATCAGGG GCGGGTGGTC TCTTACGAAA AACATTTGCT ATCGATGAAA GATAACGACC AAACGGCTAA CCTCGGCGCG TTGATCGATG CAGGCGTACG TTCCTTCAAG ATTGAAGGGC GCTATAAAGA CATGAGCTAT GTCAAAAACA TCACCGCGCA TTATCGTCAG ATGCTGGACG CGATTATCGA GCAACGTGGC GATCTGGCGC GTGCGTCGGT TGGTCGGACC GAACACTTTT TTGTTCCCTC CACGGAGAAA ACCTTCCATC GCGGCAGCAC CGACTATTTT GTTAACGCGC GTAAAGGTGA TATTGGCGCA TTTGATTCAC CAAAATTTAT TGGCTTGCCG GTAGGTGAGG TGCTGAATGT GGCGAAGGAT TATCTCGACG TAGAAGCTAC GGAGCCGTTG GCGAATGGCG ATGGTCTGAA CGTGTTGATT AAGCGTGAAG TGGTAGGTTT TCGCGCCAAT ACGGTGGAGA AAACCGGTCA TAACCGCTAC CGCGTTTGGC CAAATGATAT GCCTGCCGAC CTGCATAAAG TCCGTCCGCA TCATCCGTTG AATCGTAATC TGGATCATAA CTGGCAGCAA GCGCTGACAA AAACCTCCAG TGAGCGCCGT GTGGCGGTTG ATATCATGCT GGGCGGCTGG CAGGAACAGC TTATTCTGAC GCTGACCAGT GAAGACGGTG TCTGCATCAC GCATACGCTT GATGGGGTAT TTGAGGAAGC CAACAACGCT GAAAAAGCGT TGAATAACCT AAAAGCCGGA CTGGCGAAGC TGGGACAGAC GCCTTACTAC GCGCGTGATA TGCAGGTGAC ATTACCGGCG GCGTTGTTCG TGCCAAATAG CCTGCTCAAT CAGTTCCGTC GGGAGGCGAT TGATATGCTT GACGCGGCGC GGCTGGCCCA TTATCAACGA GGTCGTCGGA AACCCGTGGC GCAGCCTGCG CCGGTCTACC CGCAAACGCA TCTCAGCTTT CTCGCTAATG TCTACAACCA CAAAGCGCGG GAATTTTATC ACCGTTACGG CGTACAATTG ATTGATGCGG CCTATGAGGC GCATCAGGAG AAGGGCGAGG TACCGGTCAT GATCACCAAA CACTGCCTGC GTTTTGCGTT CAACCTTTGT CCAAAGCAGG CGAAAGGAAA TATTAAGAGC TGGAAAGCCA CGCCGATGCA GTTGGTGCAT GGCGATGAGG TACTGACGCT AAAATTCGAC TGCCGCCCTT GCGAAATGCA TGTCATTGGC AAAATTAAAA ACCACATCTT AAAAATGCCC CAGCCCGGCA GCGTTGTCGC TTCAGTGAGC CCTGAAGCGC TGATGAAAAC GCTGCCGAAG CGCAGGGGCG TTTAA
|
Protein sequence | MRQQPHYLEL LSPARDAAIA REAILHGADA VYIGGPGFGA RHNASNSLRD IADLVPFAHR YGARIFVTLN TILHDDELEP AQRLITDLYN TGVDALIVQD MGILELDIPP IELHASTQCD IRSVEKAKFL ADVGFSQIVL ARELNLSQIA AIHQATDATI EFFIHGALCV AYSGQCYISH AQTGRSANRG DCSQACRLPY TLKDDQGRVV SYEKHLLSMK DNDQTANLGA LIDAGVRSFK IEGRYKDMSY VKNITAHYRQ MLDAIIEQRG DLARASVGRT EHFFVPSTEK TFHRGSTDYF VNARKGDIGA FDSPKFIGLP VGEVLNVAKD YLDVEATEPL ANGDGLNVLI KREVVGFRAN TVEKTGHNRY RVWPNDMPAD LHKVRPHHPL NRNLDHNWQQ ALTKTSSERR VAVDIMLGGW QEQLILTLTS EDGVCITHTL DGVFEEANNA EKALNNLKAG LAKLGQTPYY ARDMQVTLPA ALFVPNSLLN QFRREAIDML DAARLAHYQR GRRKPVAQPA PVYPQTHLSF LANVYNHKAR EFYHRYGVQL IDAAYEAHQE KGEVPVMITK HCLRFAFNLC PKQAKGNIKS WKATPMQLVH GDEVLTLKFD CRPCEMHVIG KIKNHILKMP QPGSVVASVS PEALMKTLPK RRGV
|
| |