Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1422 |
Symbol | sppA |
ID | 6490180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 1377955 |
End bp | 1379811 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642741654 |
Product | protease 4 |
Protein accession | YP_002045301 |
Protein GI | 194449756 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 0.671113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACCC TGTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG AGTGTTGAAC TTCGTCCGTG AAATGGTACT CAACCTGTTC TTTATTTTTC TGGTCCTGGT GGGCGTTGGG ATCTGGATGC AGATCGGTAA CGGCAGCAAC AGTGAACAAA CGGCGCGCGG CGCTCTGCTG CTGGATATTT CCGGCGTCAT TGTGGATAAA CCCTCCACCA ATCATCGTCT GGGCGCGCTG GGTCGCCAGT TATTTGGCGC CAGCTCCGAC CGTCTGCAGG AAAACTCCAT GTTTGACATC GTCAACGCTA TTCGTCAGGC GAAAGATGAC CGTAACATCA CGGGTATCGT TCTGGATCTG AAAAACTTCA CCGGTGCCGA TCAGCCGTCA ATGCGCTATA TCGGTAAAGC GCTGCGCGAA TTCCGCGATA GCGGCAAACC GGTTTTCGCC GTGGGTGAAA ACTACAGCCA GGGTCAGTAT TACCTCGCCA GTTTCGCCAA TAAAATTTGG CTTTCCCCAC AGGGTCAGGT AGATCTTCAC GGCTTCGCTA CGAATGGCCT GTACTACAAA ACGCTGCTGG ATAAACTGAA AGTCTCTACC CACGTTTTCC GGGTCGGCAC CTATAAATCC GCCGTCGAGC CGTTTATCCG CGACGATATG TCGCCCGCCG CCCGCGAGGC CGACAGCCGC TGGATAGGCG AACTGTGGCA GAACTACCTG CATACCGTTT CTGCCAATCG CCAGATTTCG CCGCAACAAC TGTTCCCCGG CGCGCAGGCT ATTATCGACG GGTTAACCAG CGTGGGCGGC GACACCGCCA AATATGCGCT CGACCATAAA CTGGTGGACG CCCTCGCCTC CAGCGCAGAT GTTGAAAAAG CGCTGACGAA GCAGTTTGGC TGGAGCAAAA CCGAAAATAA CTATCGCGCG ATCAGCTATT ACGATTATTC GCTGAAAACG CCTGCGGATA CCGGCGGTAC TATTGCGGTT ATTTTCGCCA ATGGCGCGAT TATGGATGGC GAAGAAACGC CAGGGAATGT CGGCGGCGAC ACTACGGCAT CGCAGATCCG CGACGCACGC CTTGATCCTA AAGTGAAAGC GATTGTGCTG CGCGTCAATA GCCCAGGCGG GAGCGTCAAC GCCTCCGAAG TTATCCGCGC CGAACTGGCG GCAGCAAGAG CGGCTGGCAA ACCGGTGGTG GTCTCAATGG GCGGCATGGC GGCCTCCGGC GGTTACTGGA TCTCTACGCC GGCAAACTAT ATCGTGGCCA GCCCCAGCAC GCTGACGGGT TCAATTGGCA TCTTCGGCGT CATCAATACG GTAGAAAACA GCCTGTCGTC GATTGGCGTA CACAGCGACG GCGTTTCCAC CTCGCCGCTG GCGGATATTT CGATGACCAA AGCGCTGTCA CCGGAAGTGC AGCAGATGAT GCAACTCAGT ATTGAGTACG GCTACAAACG CTTTATCACG CTGGTGGCAG ACGCGCGTAA GCGTACGCCG GAGCAGATTG ATAAAATCGC GCAAGGCCAT GTCTGGACCG GAGAAGACGC GAAAGCCAAT GGTCTGGTGG ACAGTCTCGG CGACTTTGAC GACGCCGTCG CCAAAGCGGC GGAGCTGGCG AAACTGAAAC AGTGGCATCT TGATTACTAT CAGGACGAAC CGACGGTCCT TGATATGGTC ATGGACAGTA TGACCGGATC AGTACGCGCC ATGCTGCCGG AGACCATTCA GGCGATGCTC CCGGCGCCGC TCGTTTCCGC CGCCAACACG GTGAAGGCCG AGGGGGATAA ACTGGCGGCA TTTAACGATC CGCAAAACCG TTATGCGTTC TGTTTGACTT GCGCGAACGT TCGCTAA
|
Protein sequence | MRTLWRFIAG FFKWTWRVLN FVREMVLNLF FIFLVLVGVG IWMQIGNGSN SEQTARGALL LDISGVIVDK PSTNHRLGAL GRQLFGASSD RLQENSMFDI VNAIRQAKDD RNITGIVLDL KNFTGADQPS MRYIGKALRE FRDSGKPVFA VGENYSQGQY YLASFANKIW LSPQGQVDLH GFATNGLYYK TLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL HTVSANRQIS PQQLFPGAQA IIDGLTSVGG DTAKYALDHK LVDALASSAD VEKALTKQFG WSKTENNYRA ISYYDYSLKT PADTGGTIAV IFANGAIMDG EETPGNVGGD TTASQIRDAR LDPKVKAIVL RVNSPGGSVN ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY IVASPSTLTG SIGIFGVINT VENSLSSIGV HSDGVSTSPL ADISMTKALS PEVQQMMQLS IEYGYKRFIT LVADARKRTP EQIDKIAQGH VWTGEDAKAN GLVDSLGDFD DAVAKAAELA KLKQWHLDYY QDEPTVLDMV MDSMTGSVRA MLPETIQAML PAPLVSAANT VKAEGDKLAA FNDPQNRYAF CLTCANVR
|
| |