Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2321 |
Symbol | |
ID | 6482158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2241094 |
End bp | 2242455 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642737664 |
Product | peptidase, U32 family |
Protein accession | YP_002041406 |
Protein GI | 194445636 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAAC CAGAACTCCT TTCGCCGGCG GGAACGCTGA AAAATATGCG TTACGCTTTC GCTTACGGTG CCGATGCCGT CTATGCGGGC CAACCACGCT ACTCTTTACG CGTGCGTAAT AACGAATTCA ATCACGAAAA TTTGCAGCTT GGCATCAACG AAGCCCACGC GCTCGGAAAA AAATTCTACG TGGTGGTGAA CATCGCCCCG CATAACGCCA AGCTCAAAAC CTTTATCCGT GACCTGAAAC CCGTCGTCGA GATGGGCCCG GATGCGCTGA TCATGTCCGA TCCAGGGTTG ATTATGCTGG TACGCGAGCA CTTCCCGACA ATGCCGATTC ACCTGTCGGT ACAGGCTAAC GCCGTAAACT GGGCGACGGT AAAATTCTGG CAGCAGATGG GGCTGACCCG TGTGATTCTC TCCCGCGAAC TGTCGCTGGA AGAGATTGAG GAAATTCGCC AGCAGGTGCC GGATATGGAA ATAGAAATTT TCGTCCACGG CGCGCTATGC ATGGCCTATT CCGGCCGCTG CCTGCTTTCC GGCTACATCA ATAAACGCGA TCCGAATCAG GGCACCTGCA CCAATGCCTG CCGTTGGGAA TATAACGTGC AGGAAGGAAA AGAAGACGTT GTCGGCAACA TCGTGCATAA GCACGAACCG ATTCCGGTAC AGAACGTTGA GCCGACGCTC GGTATCGGCG CGCCGACGGA TAAAGTGTTT ATGATAGAAG AGGCCCAAAG ACCGGGCGAA TACATGACCG CGTTCGAAGA CGAGCATGGC ACCTATATCA TGAACTCAAA AGATTTGCGC GCTATCGCCC ACGTGGAGCG CCTGACGAAA ATGGGCGTCC ACTCGCTGAA AATCGAAGGC CGCACCAAAT CCTTTTATTA CTGCGCCCGT ACCGCGCAGG TCTACCGTAA GGCCATCGAC GACGCCGCCG CGGGTAAACC CTTCGACCCT ACGCTGCTGG AAACGTTGGA AGGTCTGGCT CATCGCGGCT ATACCGAAGG TTTCCTGCGT CGCCATACGC ACGACGATTA CCAGAATTAC GAGTACGGGT ACTCCGTTTC CGAACGCCAG CAATTTGTCG GCGAGTTCAC CGGCGAGCGT AAAGGCCAAC TGGCGGCCGT GACGGTGAAA AATAAATTCT CCGTTGGCGA TAGTCTGGAG CTGATGACAC CGCAGGGAAA TATCCATTTC ACCCTGGAAC AGATGGAGAA CGCCAAAGGC GACGCTATGC CGGTGGCACC TGGCGATGGC TATACCGTCT GGATGCCCGT CCCGCAGGAC GTTACGCTGG ATTACGCACT ATTGATGCGT AATTTCTCAG GCGAATCAAC GCGTAACCCC TATGCTAAGT AG
|
Protein sequence | MFKPELLSPA GTLKNMRYAF AYGADAVYAG QPRYSLRVRN NEFNHENLQL GINEAHALGK KFYVVVNIAP HNAKLKTFIR DLKPVVEMGP DALIMSDPGL IMLVREHFPT MPIHLSVQAN AVNWATVKFW QQMGLTRVIL SRELSLEEIE EIRQQVPDME IEIFVHGALC MAYSGRCLLS GYINKRDPNQ GTCTNACRWE YNVQEGKEDV VGNIVHKHEP IPVQNVEPTL GIGAPTDKVF MIEEAQRPGE YMTAFEDEHG TYIMNSKDLR AIAHVERLTK MGVHSLKIEG RTKSFYYCAR TAQVYRKAID DAAAGKPFDP TLLETLEGLA HRGYTEGFLR RHTHDDYQNY EYGYSVSERQ QFVGEFTGER KGQLAAVTVK NKFSVGDSLE LMTPQGNIHF TLEQMENAKG DAMPVAPGDG YTVWMPVPQD VTLDYALLMR NFSGESTRNP YAK
|
| |