Gene SeSA_A2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2372 
Symbol 
ID6518198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2251228 
End bp2252589 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content55% 
IMG OID642747433 
Productpeptidase, U32 family 
Protein accessionYP_002115226 
Protein GI194735950 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.444548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC CAGAACTCCT TTCGCCGGCG GGAACGCTGA AAAATATGCG TTACGCTTTC 
GCTTACGGTG CCGATGCCGT CTATGCGGGC CAACCGCGCT ACTCTTTACG CGTGCGTAAT
AACGAATTCA ATCACGAAAA TTTGCAGCTT GGCATCAACG AAGCCCACGC GCTCGGAAAA
AAATTCTACG TGGTGGTGAA CATCGCCCCG CATAATGCCA AGCTCAAAAC CTTTATCCGT
GACCTGAAAC CCGTCGTCGA GATGGGCCCG GATGCGCTGA TCATGTCCGA TCCAGGGTTG
ATTATGCTGG TACGCGAGCA CTTCCCGGCA ATGCCGATTC ACCTGTCGGT ACAGGCTAAT
GCCGTAAACT GGGCGACGGT AAAATTCTGG CAGCAGATGG GGCTGACCCG TGTGATTCTC
TCCCGCGAGC TGTCGCTGGA AGAGATTGAG GAAATTCGCC AGCAGGTGCC GGATATGGAA
ATAGAAATTT TCGTCCACGG CGCGCTATGC ATGGCCTATT CCGGCCGCTG CCTGCTTTCC
GGCTACATCA ATAAACGCGA TCCGAATCAG GGCACCTGCA CCAATGCCTG CCGTTGGGAA
TATAACGTGC AGGAAGGAAA AGAAGACGTT GTCGGCAACA TCGTGCATAA GCACGAACCG
ATTCCGGTAC AGAACGTTGA GCCGACGCTC GGTATCGGCG CGCCGACGGA TAAAGTGTTT
ATGATAGAAG AGGCCCAAAG ACCGGGCGAA TACATGACCG CGTTCGAAGA CGAGCATGGC
ACCTATATCA TGAACTCAAA AGATTTGCGC GCTATCGCCC ACGTGGAGCG CCTGACGAAA
ATGGGCGTCC ACTCGCTGAA AATCGAAGGC CGCACCAAAT CCTTTTATTA CTGCGCCCGT
ACCGCGCAGG TCTACCGTAA GGCCATCGAC GACGCCGCCG CGGGTAAACC CTTCGACCCT
ACGCTGCTGG AAACGTTGGA AGGTCTGGCT CATCGCGGCT ATACCGAAGG TTTCCTGCGT
CGCCATACGC ACGACGATTA CCAGAATTAC GAGTACGGGT ACTCCGTTTC CGAACGCCAG
CAATTTGTCG GCGAGTTCAC CGGCGAGCGT AAAGGCCAAC TGGCGGCCGT GGCGGTGAAA
AATAAATTCT CCGTTGGCGA TAGTCTGGAG CTGATGACAC CGCAGGGAAA TATCAATTTC
ACCCTGGAAC AGATGGAGAA CGCCAAAGGC GACGCTATGC CGGTGGCGCC TGGCGATGGC
TATACCGTCT GGATGCCCGT CCCGCAGGAC GTTACGCTGG ATTACGCACT ATTGATGCGT
AATTTCTCAG GCGAATCAAC GCGTAACCCC CATGCCAAGT AG
 
Protein sequence
MFKPELLSPA GTLKNMRYAF AYGADAVYAG QPRYSLRVRN NEFNHENLQL GINEAHALGK 
KFYVVVNIAP HNAKLKTFIR DLKPVVEMGP DALIMSDPGL IMLVREHFPA MPIHLSVQAN
AVNWATVKFW QQMGLTRVIL SRELSLEEIE EIRQQVPDME IEIFVHGALC MAYSGRCLLS
GYINKRDPNQ GTCTNACRWE YNVQEGKEDV VGNIVHKHEP IPVQNVEPTL GIGAPTDKVF
MIEEAQRPGE YMTAFEDEHG TYIMNSKDLR AIAHVERLTK MGVHSLKIEG RTKSFYYCAR
TAQVYRKAID DAAAGKPFDP TLLETLEGLA HRGYTEGFLR RHTHDDYQNY EYGYSVSERQ
QFVGEFTGER KGQLAAVAVK NKFSVGDSLE LMTPQGNINF TLEQMENAKG DAMPVAPGDG
YTVWMPVPQD VTLDYALLMR NFSGESTRNP HAK