Gene SNSL254_A1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1843 
Symbol 
ID6486475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1809434 
End bp1810480 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID642737218 
Productputative periplasmic protease 
Protein accessionYP_002040970 
Protein GI194444766 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0000005383 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAATTGT TGTCTGAATA TGGCTTATTT TTGGCAAAAA TCGTCACCGT TGTGGTGGCC 
ATTGCCGTCA TTGTGCTGCT GATCGTGAAT GCTACGCAAC GCAAACGTCA GCGCGGCGAG
CTGCGCGTGA CCAATTTGAG TGAGCAGTAT CAGGAGATGA AGGATGACCT TGCTGCGGCG
TTGATGGATG GCCATCAGCA AAAACTGTGG CATAAAGCGC AGAAAAAAAA GCATAAGCAG
GAGGCGAAAG CCGCCAAAGC GAAAGCGAAG CTGGGGGACA TTGCGACATC GGACAAACCG
CGCGTATGGG TGATAGATTT CAAAGGCAGT ATGGACGCTC ACGAAGTTAA TGCGTTACGC
GAAGAGGTCA CGGCGGTGCT GGCAGTGGTG AAACCCGGCG ATCGGGCGGT TGTGCGTCTG
GAAAGCCCCG GTGGCGTTGT GCACGGCTAT GGCCTGGCGG CATCGCAATT GCAGCGCCTG
CGCGATAAAA ATATTCCGCT GACCGTGACG GTGGATAAAG TCGCGGCAAG CGGAGGCTAC
ATGATGGCCT GCGTGGCGGA AAAAATTATC GCGGCGCCGT TTGCTATTGT GGGGTCAATT
GGTGTTGTCG CGCAAATCCC GAACTTTAAC CGCTTTCTCA AAAGTAAAGA CATTGATATT
GAACTGCATA CCGCAGGGCA GTACAAACGT ACCCTGACTT TGTTAGGCGA GAATACGGAA
GAAGGGCGGC AGAAGTTTCG TGAAGATCTC AACGAAACGC ACCATCTGTT CAAAGAGTTT
GTGCAGCGGA TGCGTCCGGC TCTGGACATT GAACAGGTCG CCACGGGCGA ACACTGGTAC
GGTCAGCAGG CGCTGGAGAA AGGACTGGTT GATGAGATTA ACACCAGCGA TGAGGTTATC
CTCGGCCTGA TGGAAGGGCG TGAGGTGCTG AATGTGCGCT ATATGCAGCG TAAAAAACTG
ATCGATCGTG TTACCGGCAG CGCGGCGGAA AGCGCGGATC GGCTACTGCT GCGCTGGTGG
CAGCGTGGAC AAAAGCCATT GATGTAA
 
Protein sequence
MELLSEYGLF LAKIVTVVVA IAVIVLLIVN ATQRKRQRGE LRVTNLSEQY QEMKDDLAAA 
LMDGHQQKLW HKAQKKKHKQ EAKAAKAKAK LGDIATSDKP RVWVIDFKGS MDAHEVNALR
EEVTAVLAVV KPGDRAVVRL ESPGGVVHGY GLAASQLQRL RDKNIPLTVT VDKVAASGGY
MMACVAEKII AAPFAIVGSI GVVAQIPNFN RFLKSKDIDI ELHTAGQYKR TLTLLGENTE
EGRQKFREDL NETHHLFKEF VQRMRPALDI EQVATGEHWY GQQALEKGLV DEINTSDEVI
LGLMEGREVL NVRYMQRKKL IDRVTGSAAE SADRLLLRWW QRGQKPLM