Gene SeD_A1612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1612 
Symbol 
ID6872404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1554476 
End bp1555522 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID642784757 
Productputative periplasmic protease 
Protein accessionYP_002215425 
Protein GI198243766 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000000000117623 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAATTGT TGTCTGAATA TGGCTTATTT TTGGCAAAAA TCGTCACCGT TGTGGTGGCC 
ATTGCCGTCA TTGTGCTGCT GATCGTGAAT GCTACGCAAC GCAAACGTCA GCGCGGCGAG
CTGCGCGTGA CCAATTTGAG CGAGCAGTAT CAGGAGATGA AGGATGACCT TGCTGCGGCG
TTGATGGATG GCCATCAGCA AAAACTGTGG CATAAAGCGC AGAAAAAAAA GCATAAGCAG
GAGGCGAAAG CCGCCAAAGC GAAAGCGAAG CTGGGGGACA TTGCGACATC GGACAAACCG
CGCGTATGGG TGATAGATTT CAAAGGCAGT ATGGACGCTC ACGAAGTTAA TGCGTTACGC
GAAGAGGTCA CGGCGGTGCT GGCAGTGGCG AAACCCGGCG ATCGGGCGGT TGTGCGTCTG
GAAAGCCCCG GTGGCGTTGT GCACGGCTAT GGCCTGGCGG CATCGCAATT GCAGCGCCTG
CGCGATAAAA ATATTCCGCT GACCGTGACG GTGGATAAAG TCGCGGCAAG CGGAGGCTAC
ATGATGGCCT GCGTGGCGGA AAAAATTATC GCGGCGCCGT TCGCTATTGT GGGGTCAATT
GGTGTTGTCG CGCAAATCCC GAACTTTAAC CGCTTTCTCA AAAGTAAAGA CATTGATATT
GAACTGCATA CCGCAGGGCA GTACAAACGT ACCCTGACTT TGTTAGGCGA GAATACGGAA
GAAGGGCGGC AGAAGTTTCG TGAAGATCTC AACGAAACGC ACCATCTGTT CAAAGAGTTT
GTGCAGCGGA TGCGTCCGGC TCTGGACATT GAACAGGTCG CCACGGGCGA ACACTGGTAC
GGTCAGCAGG CGCTGGAGAA GGGACTGGTT GATGAGATTA ACACCAGCGA TGAGGTTATC
CTCGGCCTGA TGGAAGGGCG CGAGGTGCTG AATGTGCGCT ATATGCAGCG TAAAAAACTG
ATCGATCGTG TTACCGGCAG CGCGGCGGAA AGCGCGGATC GGCTGCTGCT GCGCTGGTGG
CAGCGTGGAC AAAAGCCGTT GATGTAA
 
Protein sequence
MELLSEYGLF LAKIVTVVVA IAVIVLLIVN ATQRKRQRGE LRVTNLSEQY QEMKDDLAAA 
LMDGHQQKLW HKAQKKKHKQ EAKAAKAKAK LGDIATSDKP RVWVIDFKGS MDAHEVNALR
EEVTAVLAVA KPGDRAVVRL ESPGGVVHGY GLAASQLQRL RDKNIPLTVT VDKVAASGGY
MMACVAEKII AAPFAIVGSI GVVAQIPNFN RFLKSKDIDI ELHTAGQYKR TLTLLGENTE
EGRQKFREDL NETHHLFKEF VQRMRPALDI EQVATGEHWY GQQALEKGLV DEINTSDEVI
LGLMEGREVL NVRYMQRKKL IDRVTGSAAE SADRLLLRWW QRGQKPLM