Gene SeAg_B1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1557 
Symbol 
ID6793499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1516275 
End bp1518239 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content53% 
IMG OID642775796 
Productpeptidase, U32 family 
Protein accessionYP_002146432 
Protein GI197248351 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCAGC AACCTCACTA TCTCGAATTG TTAAGTCCGG CCCGTGACGC CGCAATTGCT 
CGCGAAGCGA TTTTGCATGG CGCAGATGCC GTCTACATCG GCGGACCCGG TTTTGGCGCA
CGTCATAACG CCAGTAACAG TCTGCGCGAT ATCGCCGATC TGGTCCCGTT TGCTCACCGT
TACGGCGCCA GGATTTTTGT CACGCTGAAT ACTATCCTGC ATGATGATGA GCTGGAGCCC
GCGCAGCGGT TAATCACCGA TTTGTACAAC ACCGGGGTGG ATGCGCTGAT TGTGCAGGAT
ATGGGCATTC TGGAACTGGA TATCCCGCCG ATTGAGCTTC ACGCCAGTAC ACAGTGTGAT
ATTCGCAGCG TGGAAAAAGC GAAGTTTCTT GCCGATGTCG GTTTTTCACA GATTGTACTG
GCGCGCGAGC TTAATTTGAG TCAGATAGCG GCTATTCATC AGGCTACTGA CGCCACGATT
GAGTTCTTCA TTCATGGCGC GCTGTGTGTC GCTTATTCTG GGCAGTGTTA TATCTCTCAT
GCGCAAACCG GGCGCAGCGC CAATCGGGGC GACTGTTCGC AGGCCTGTCG TTTACCGTAT
ACGTTAAAAG ACGATCAGGG GCGGGTGGTC TCTTACGAAA AACATTTGCT ATCGATGAAA
GATAACGACC AAACGGCTAA CCTCGGCGCG TTGATCGATG CAGGCGTACG TTCCTTCAAG
ATTGAAGGGC GCTACAAAGA CATGAGCTAT GTCAAAAACA TCACCGCGCA TTATCGTCAG
ATGCTGGACG CGATTATCGA GCAACGTGGC GATCTGGCGC GTGCATCGGT TGGTCGGACC
GAGCATTTTT TTGTTCCCTC CACGGAGAAA ACCTTCCATC GCGGCAGCAC CGACTATTTT
GTTAACGCGC GTAAAGGTGA TATTGGCGCA TTTGATTCAC CAAAATTTAT TGGCTTGCCG
GTAGGCGAGG TGCTGAATGT GGCGAAGGAT TATCTCGACG TAGAAGCTAC GGAGCCGTTG
GCGAATGGCG ATGGTCTGAA CGTGTTGATT AAGCGTGAAG TGGTGGGTTT TCGCGCCAAT
ACGGTGGAGA AAACCGGTCA TAACCGCTAC CGCGTTTGGC CAAATGATAT GCCTGCCGAC
CTGCATAAAG TCCGTCCGCA TCATCCGTTG AATCGTAATC TGGATCATAA CTGGCAGCAA
GCGCTGACAA AAACCTCCAG TGAGCGCCGT GTGGCGGTTG ATATCATGCT GGGCGGCTGG
CAGGAACAGC TTATTCTGAC GCTGACCAGT GAAGACGGTG TCTGCATCAC GCATACGCTT
GACGGGGTAT TTGAGGAAGC CAATAACTCT GAAAAAGCGT TGAATAACCT AAAAGCCGGA
CTGGCGAAGC TGGGACAGAC GCCTTACTAC GCGCGTGATA TGCAGGTGAC ATTACCGGCG
GCGTTGTTCG TGCCAAATAG CCTGCTCAAT CAGTTCCGTC GGGAGGCGAT TGATATGCTT
GACGCGGCGC GGCTGGCCCA TTATCAACGA GGTCGTCGGA AACCCGTGGC GCAGCCTGCG
CCGGTCTATC CGCAAACGCA TCTCAGCTTT CTCGCTAATG TCTACAACCA CAAAGCGCGG
GAATTTTATC ACCGTTACGG CGTACAATTG ATTGATGCGG CCTATGAGGC GCATCAGGAG
AAGGGCGAGG TACCGGTCAT GATCACCAAA CACTGCCTGC GTTTTGCGTT CAACCTTTGT
CCTAAGCAGG CGAAAGGAAA TATTAAGAGC TGGAAAGCCA CGCCGATGCA GTTGGTGCAT
GGCGATGAGG TACTGACGCT AAAATTCGAC TGCCGCCCTT GCGAAATGCA TGTCATTGGC
AAAATTAAAA ACCACCTCTT AAAAATGCCC CAGCCCGGCA GCGTTGTCGC TTCAGTGAGC
CCTGAAGCGC TGATGAAAAC GCTGCCGAAG CGCAGGGGCG TTTAA
 
Protein sequence
MRQQPHYLEL LSPARDAAIA REAILHGADA VYIGGPGFGA RHNASNSLRD IADLVPFAHR 
YGARIFVTLN TILHDDELEP AQRLITDLYN TGVDALIVQD MGILELDIPP IELHASTQCD
IRSVEKAKFL ADVGFSQIVL ARELNLSQIA AIHQATDATI EFFIHGALCV AYSGQCYISH
AQTGRSANRG DCSQACRLPY TLKDDQGRVV SYEKHLLSMK DNDQTANLGA LIDAGVRSFK
IEGRYKDMSY VKNITAHYRQ MLDAIIEQRG DLARASVGRT EHFFVPSTEK TFHRGSTDYF
VNARKGDIGA FDSPKFIGLP VGEVLNVAKD YLDVEATEPL ANGDGLNVLI KREVVGFRAN
TVEKTGHNRY RVWPNDMPAD LHKVRPHHPL NRNLDHNWQQ ALTKTSSERR VAVDIMLGGW
QEQLILTLTS EDGVCITHTL DGVFEEANNS EKALNNLKAG LAKLGQTPYY ARDMQVTLPA
ALFVPNSLLN QFRREAIDML DAARLAHYQR GRRKPVAQPA PVYPQTHLSF LANVYNHKAR
EFYHRYGVQL IDAAYEAHQE KGEVPVMITK HCLRFAFNLC PKQAKGNIKS WKATPMQLVH
GDEVLTLKFD CRPCEMHVIG KIKNHLLKMP QPGSVVASVS PEALMKTLPK RRGV