Gene SeAg_B3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B3141 
Symbol 
ID6795775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp3067075 
End bp3069963 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content52% 
IMG OID642777296 
Productprotease 3 
Protein accessionYP_002147903 
Protein GI197248364 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCA GCACCTGGTT CAAAGCGTTA TTGTTATTGG TCGCCCTCTG GGGGCCTGCA 
GTTCAGGCGG ATATCGGTTG GCAACCGCTG CAAGAAACTA TCCGTAAAAG CGATAAAGAT
ACCCGGCAGT ATCAGGCGAT ACGTCTTGAT AATGACATGG TGGTGTTGCT GGTATCCGAT
CCGCAGGCTG TAAAGTCGCT TTCAGCGTTA GTGGTACCCG TTGGATCGCT TGAAGATCCT
GAGGCTCATC AGGGGCTTGC TCATTATCTT GAGCATATGT GCCTGATGGG GTCAAAAAAA
TATCCGCAGG CGGATAGCCT TGCCGAATAC CTTAAAAGAC ATGGCGGTAG CCACAACGCC
AGCACCGCGC CTTATCGAAC AGCCTTCTAC CTGGAGGTTG AAAACGACGC GCTTCCAGGC
GCCGTAGATC GCCTGGCCGA CGCCATTGCC GCGCCATTGC TCGATAAAAA GTATGCCGAA
CGCGAACGAA ACGCGGTGAA TGCCGAGCTG ACGATGGCGC GCACCCGCGA CGGTATGCGT
ATGGCGCAGG TAAGCGCCGA AACCATTAAC CCGGCGCATC CAGGCTCGCA CTTTTCTGGC
GGCAATCTGG AAACGCTAAG CGATAAGCCG GGCAATCCGG TGCAACAGGC GTTGATCGCT
TTTCATGAAA AATACTATTC ATCTAATCTG ATGAAGGCGG TGATTTACAG TAATAAACCC
TTGCCGGAAC TGGCGAGTAT TGCCGCCGCA ACCTATGGTC GCGTGCCGAA TAAACAGATT
AAAAAACCGG AAATTACCGT ACCTGTCATC ACCGAGGCGC AGAAGGGCAT CATTATTCAT
TACGTGCCGG CACTACCGCG TAAAGTGCTG CGCGTGGAGT TTCGTATTGA TAACAATAGC
GCGCAGTTCC GCAGCAAGAC CGATGAACTG GTCTCTTATC TGATTGGCAA TCGTAGCCCA
GGGACGCTCT CTGACTGGCT GCAAAAACAA GGACTGGTCG AAGGTATTAG CGCGGATTCC
GATCCGATTG TTAATGGTAA CAGCGGCGTA TTTGCCATTT CGGCAACGCT GACCGATAAA
GGTTTGGCCA ATCGTGATGA AGTCGTCGCG GCTATTTTCA GCTATCTCAA TATGTTACGT
GAAAAAGGGA TAGATAAACG TTACTTTGAC GAGCTGGCGC ATGTGCTGGA TCTCGACTTC
CGCTATCCGT CGATTACCCG CGATATGGAC TATGTCGAAT GGCTGGCGGA CACTATGATC
CGCGTCCCGG TAGCGCATAC GCTGGATGCG GCGAATATCG CCGATCGTTA CGATCCTGCC
GCTATCAAAA ATCGCCTGGC GATGATGACG CCGCAGAATG CGCGCATCTG GTACATCAGT
CCCCAGGAAC CCCATAATAA GACGGCGTAT TTTGTGGATG CGCCTTATCA GGTCGATAAG
ATTAGCGAAC AGACGTTTAA AAACTGGCAG CAAAAAGCAC AGGGCATTGC GTTGTCGCTG
CCGGAGTTAA ACCCCTATAT TCCTGATGAT TTTACGCTTG TTAAGAATGA CAAAAACTAC
GTGCGGCCAG AACTGATTGT CGATAAAGCG GATTTGCGCG TGGTTTATGC GCCGAGTCGT
TATTTCGCCA GCGAGCCAAA AGCTGACGTG AGCGTGGTAT TGCGTAACCC GCAGGCGATG
GACAGCGCCC GCAATCAGGT ACTGTTCGCC CTTAATGATT ATCTGGCGGG AATGGCGCTC
GATCAGCTCA GCAACCAGGC GGCAGTGGGC GGCATTAGCT TTTCCACTAA CGCCAACAAT
GGTCTGATGG TCACTGCGAA CGGTTATACC CAGCGTTTGC CGCAGCTTTT CCTGGCGTTG
CTGGAAGGAT ATTTTAGCTA CGACGCCACG GAGGAACAGC TGGCGCAGGC GAAATCCTGG
TATACGCAGA TGATGGATTC TGCGGAGAAG GGAAAAGCCT ACGAACAGGC GATTATGCCG
GTGCAGATGA TTTCGCAGGT GCCTTATTTT TCCCGTGATG AACGTCGCGC TTTGCTGCCG
TCCATTACGT TGAAAGAGGT GATGGCCTAT CGTAATGCGT TAAAAACGGG CGCTCGTCCG
GAATTTCTGG TTATAGGGAA TATGAGCGAA GCCCAGGCGA CCTCTCTGGC GCAAGATGTT
CAAAAACAGC TCGCGGCGAA CGGATCGGCG TGGTGTCGCA ACAAAGACGT AGTGGTTGAG
AAAAAGCAGT CCGTAATATT TGAAAAAGCG GGCAGTAGCA CCGACTCCGC GTTAGCCGCG
GTCTTTGTCC CGGTCGGCTA CGACGAGTAC GTCAGCGCCG CCTACAGCGC GATGTTAGGT
CAGATTGTTC AACCGTGGTT TTACAATCAG CTACGAACCG AAGAGCAACT GGGATACGCC
GTTTTCGCCT TTCCGATGAG CGTTGGCCGT CAGTGGGGAA TGGGATTCCT GCTACAGAGC
AACGATAAAC AGCCCTCTTA CCTGTGGCAA CGCTATCAGG CATTTTTCCC TGACGCCGAG
GCGAAGCTGA GGGCGATGAA GCCGGAAGAG TTCGCCCAAA TTCAGCAGGC GATCATTACG
CAAATGCGCC AGGCGCCGCA AACGTTGGGC GAAGAAGCAT CCCGTTTAAG CAAGGATTTC
GATCGGGGTA ATATGCGCTT TGACTCGCGT GATAAAATCA TCGCTCAGAT AAAATTGCTG
ACGCCACAAA AGCTTGCCGA CTTCTTCCAC CAGGCGGTGG TGGAACCACA AGGTATGGCA
ATATTGTCAC AGATTGCTGG TAGCCAGAAT GGAAAAGCAG AATACGTGCA TCCGACAGGC
TGGAAAGTGT GGGATAACGT CAGCGCATTG CAGCAAACGT TACCTCTAAT GAGCGAAAAG
AATGAATGA
 
Protein sequence
MPRSTWFKAL LLLVALWGPA VQADIGWQPL QETIRKSDKD TRQYQAIRLD NDMVVLLVSD 
PQAVKSLSAL VVPVGSLEDP EAHQGLAHYL EHMCLMGSKK YPQADSLAEY LKRHGGSHNA
STAPYRTAFY LEVENDALPG AVDRLADAIA APLLDKKYAE RERNAVNAEL TMARTRDGMR
MAQVSAETIN PAHPGSHFSG GNLETLSDKP GNPVQQALIA FHEKYYSSNL MKAVIYSNKP
LPELASIAAA TYGRVPNKQI KKPEITVPVI TEAQKGIIIH YVPALPRKVL RVEFRIDNNS
AQFRSKTDEL VSYLIGNRSP GTLSDWLQKQ GLVEGISADS DPIVNGNSGV FAISATLTDK
GLANRDEVVA AIFSYLNMLR EKGIDKRYFD ELAHVLDLDF RYPSITRDMD YVEWLADTMI
RVPVAHTLDA ANIADRYDPA AIKNRLAMMT PQNARIWYIS PQEPHNKTAY FVDAPYQVDK
ISEQTFKNWQ QKAQGIALSL PELNPYIPDD FTLVKNDKNY VRPELIVDKA DLRVVYAPSR
YFASEPKADV SVVLRNPQAM DSARNQVLFA LNDYLAGMAL DQLSNQAAVG GISFSTNANN
GLMVTANGYT QRLPQLFLAL LEGYFSYDAT EEQLAQAKSW YTQMMDSAEK GKAYEQAIMP
VQMISQVPYF SRDERRALLP SITLKEVMAY RNALKTGARP EFLVIGNMSE AQATSLAQDV
QKQLAANGSA WCRNKDVVVE KKQSVIFEKA GSSTDSALAA VFVPVGYDEY VSAAYSAMLG
QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSYLWQ RYQAFFPDAE
AKLRAMKPEE FAQIQQAIIT QMRQAPQTLG EEASRLSKDF DRGNMRFDSR DKIIAQIKLL
TPQKLADFFH QAVVEPQGMA ILSQIAGSQN GKAEYVHPTG WKVWDNVSAL QQTLPLMSEK
NE