Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1738 |
Symbol | |
ID | 6874029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1672941 |
End bp | 1674905 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642784875 |
Product | peptidase, U32 family |
Protein accession | YP_002215543 |
Protein GI | 198244510 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCAGC AACCTCACTA TCTCGAATTG TTAAGTCCGG CCCGTGACGC CGCAATTGCT CGCGAAGCGA TTTTGCATGG CGCAGATGCT GTCTACATCG GCGGACCCGG TTTTGGCGCA CGTCATAACG CCAGTAACAG TCTGCGCGAT ATCGCCGATC TGGTTCCGTT TGCTCACCGT TACGGCGCCA GGATTTTTGT CACGCTGAAT ACTATTCTGC ATGATGATGA GCTGGAGCCC GCGCAGCGGT TAATCACCGA TTTGTACAAC ACCGGGGTGG ATGCGCTGAT TGTGCAGGAT ATGGGCATTC TGGAACTGGA TATCCCGCCG ATTGAGCTTC ACGCCAGTAC ACAGTGTGAT ATTCGCAGCG TGGAAAAAGC GAAGTTTCTT GCCGATGTCG GTTTTTCACA GATTGTACTG GCGCGCGAGC TTAATTTGAG TCAGATAGCG GCTATTCATC AGGCTACTGA CGCCACGATT GAGTTCTTCA TTCATGGCGC GCTGTGTGTC GCTTATTCTG GGCAGTGTTA TATCTCTCAT GCACAAACCG GGCGCAGCGC CAATCGGGGC GACTGTTCGC AAGCCTGTCG CTTACCGTAT ACGTTAAAAG ACGATCAGGG GCGGGTGGTC TCTTACGAAA AACATTTGCT ATCGATGAAA GATAACGACC AAACGGCTAA CCTCGGCGCG TTGATCGATG CAGGCGTACG TTCCTTCAAG ATTGAAGGGC GCTATAAAGA CATGAGCTAT GTCAAAAACA TCACCGCGCA TTATCGTCAG ATGCTGGACG CGATTATCGA GCAACGTGGC GATCTGGCGC GTGCATCGGT TGGTCGGACC GAGCATTTTT TTGTTCCCTC CACGGAAAAA ACCTTCCATC GCGGCAGCAC CGACTATTTT GTTAACGCGC GTAAAGGTGA TATTGGCGCA TTTGATTCAC CAAAATTTAT TGGCTTGCCG GTAGGCGAAG TGCTGAATGT GGCGAAGGAT TATCTCGATG TAGAAGCGAC GGAGCCGTTG GCGAATGGCG ATGGTCTGAA CGTGTTGATT AAGCGTGAAG TGGTAGGTTT TCGCGCCAAT ACGGTGGAGA AAACCGGTCA TAACCGCTAC CGCGTTTGGC CAAATGATAT GCCTGCCGAC CTGCATAAAG TCCGTCCGCA TCATCCGTTG AATCGTAATC TGGATCATAA CTGGCAGCAA GCGCTGACAA AAACCTCCAG TGAGCGCCGT GTGGCGGTTG ATATCATGCT GGGCGGCTGG CAGGAACAGC TTATTCTGAC GCTGACCAGT GAAGACGGTG TCTGCATCAC GCATACGCTT GACGGGGTAT TTGAGGAAGC CAACAACTCT GAAAAAGCGT TGAATAACCT AAAAGCCGGA CTGGCGAAGC TGGGACAGAC GCCTTACTAC GTGCGTGATA TGCAGGTGAC ATTACCGGCG GCGTTGTTCG TGCCAAATAG CCTGCTCAAT CAGTTCCGTC GGGAGGCGAT TGATATGCTT GACGCGGCGC GGCTGGCCCA TTATCAACGA GGTCGTCGGA AACCCGTGGC GCAGCCTGCG CCGGTCTACC CGCAAACGCA TCTCAGCTTT CTCGCTAATG TCTACAACCA CAAAGCGCGG GAATTTTATC ACCGTTACGG CGTACAATTG ATTGATGCGG CCTATGAGGC GCATCAGGAG AAGGGCGAGG TACCGGTCAT GATCACCAAA CACTGCCTGC GTTTTGCGTT CAACCTTTGT CCAAAGCAGG CGAAAGGAAA TATTAAGAGC TGGAAAGCCA CGCCGATGCA GTTGGTGCAT GGCGATGAGG TACTGACGCT AAAATTCGAC TGCCGCCCTT GCGAAATGCA TGTCATTGGC AAAATTAAAA ACCACATCTT AAAAATGCCC CAGCCCGGCA GCGTTGTCGC TTCAGTGAGC CCTGAAGCGC TGATGAAAAC GCTGCCGAAG CGCAGGGGCG TTTAA
|
Protein sequence | MRQQPHYLEL LSPARDAAIA REAILHGADA VYIGGPGFGA RHNASNSLRD IADLVPFAHR YGARIFVTLN TILHDDELEP AQRLITDLYN TGVDALIVQD MGILELDIPP IELHASTQCD IRSVEKAKFL ADVGFSQIVL ARELNLSQIA AIHQATDATI EFFIHGALCV AYSGQCYISH AQTGRSANRG DCSQACRLPY TLKDDQGRVV SYEKHLLSMK DNDQTANLGA LIDAGVRSFK IEGRYKDMSY VKNITAHYRQ MLDAIIEQRG DLARASVGRT EHFFVPSTEK TFHRGSTDYF VNARKGDIGA FDSPKFIGLP VGEVLNVAKD YLDVEATEPL ANGDGLNVLI KREVVGFRAN TVEKTGHNRY RVWPNDMPAD LHKVRPHHPL NRNLDHNWQQ ALTKTSSERR VAVDIMLGGW QEQLILTLTS EDGVCITHTL DGVFEEANNS EKALNNLKAG LAKLGQTPYY VRDMQVTLPA ALFVPNSLLN QFRREAIDML DAARLAHYQR GRRKPVAQPA PVYPQTHLSF LANVYNHKAR EFYHRYGVQL IDAAYEAHQE KGEVPVMITK HCLRFAFNLC PKQAKGNIKS WKATPMQLVH GDEVLTLKFD CRPCEMHVIG KIKNHILKMP QPGSVVASVS PEALMKTLPK RRGV
|
| |