Gene SeD_A3321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3321 
Symbol 
ID6871413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3197393 
End bp3200281 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content52% 
IMG OID642786328 
Productprotease 3 
Protein accessionYP_002216967 
Protein GI198242678 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.328834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCA GCACCTGGTT CAAAGCGTTA TTGTTATTGG TCGCCCTCTG GGGGCCTGCA 
GTTCAGGCGG ATATCGGTTG GCAACCGCTG CAAGAAACTA TCCGTAAAAG CGATAAAGAT
ACCCGGCAGT ATCAGGCGAT ACGTCTTGAT AATGACATGG TGGTGTTGCT CGTATCCGAT
CCGCAGGCTG TAAAGTCGCT TTCAGCGTTA GTGGTGCCCG TTGGATCGCT TGAAGATCCT
GAGGCTCATC AGGGGCTTGC TCATTATCTT GAGCATATGT GCCTGATGGG GTCAAAAAAA
TATCCGCAGG CGGATAGCCT TGCCGAATAC CTTAAAAGAC ATGGCGGTAG CCACAACGCC
AGCACCGCGC CTTATCGAAC AGCCTTCTAC CTGGAGGTTG AAAACGACGC GCTTCCAGGT
GCCGTAGATC GACTGGCCGA CGCCATTGCC GCGCCATTGC TCAATAAAAA GTATGCCGAA
CGCGAACGAA ACGCGGTGAA TGCCGAGCTG ACGATGGCGC GCACCCGCGA CGGTATGCGT
ATGGCGCAGG TAAGCGCCGA AACCATTAAC CCGGCGCATC CAGGCTCGCA CTTTTCTGGC
GGCAATCTGG AAACGCTAAG CGATAAGCCG GGAAATCCGG TGCAACAGGC GTTGATCGCT
TTTCATGAAA AATACTATTC ATCTAATCTG ATGAAGGCGG TGATTTACAG TAATAAACCC
TTGCCGGAAC TGGCGAGTAT TGCCGCCGCA ACCTATGGTC GCGTGCCGAA TAAACAGATT
AAAAAACCGG AAATTACCGT ACCTGTCATC ACCGAGGCGC AGAAGGGCAT CATTATTCAT
TACGTGCCGG CGCTACCGCG TAAAGTGCTG CGCGTGGAGT TTCGTATTGA TAACAATAGC
GCGCAGTTCC GTAGCAAGAC CGATGAACTG GTCTCTTATC TGATTGGCAA TCGTAGCCCG
GGGACGCTCT CTGACTGGCT GCAAAAACAA GGACTGGTCG AAGGTATTAG CGCGGATTCC
GATCCGATTG TTAATGGTAA CAGCGGCGTA TTTGCCATTT CGGCAACGCT GACTGATAAA
GGTTTGGCCA ATCGTGATGA AGTCGTCGCA GCTATCTTTA GCTATCTCAA TACGTTACGT
GAAAAAGGGA TAGATAAACG TTACTTTGAC GAGCTGGCGC ATGTGCTGGA TCTCGACTTC
CGCTATCCGT CGATTACCCG CGATATGGAC TATGTCGAGT GGCTGGCGGA CACTATGATC
CGCGTCCCGG TAGCGCATAC GCTGGATGCG GCGAATATCG CCGATCGTTA CGATCCTGCC
GCTATCAAAA ATCGCCTGGC GATGATGACG CCGCAGAATG CGCGCATCTG GTACATCAGT
CCCCAGGAAC CCCATAATAA GATTGCGTAT TTTGTGGATG CGCCTTATCA GGTCGATAAG
ATTAGCGAAC AGACGTTTAA AAACTGGCAG CAAAAAGCAC AGGGCATTGC GTTGTCGCTG
CCGGAGTTAA ACCCCTATAT TCCTGATGAT TTTTCGCTTG TTAAGAATGA CAAAAACTAC
GTGCGGCCAG AACTGATTGT CGATAAAGCG GATTTGCGCG TGGTTTATGC GCCGAGTCGT
TATTTCGCCA GCGAGCCAAA AGCTGACGTG AGCGTGGTAT TGCGTAACCC GCAGGCGATG
GACAGCGCCC GCAATCAGGT ACTGTTCGCC CTTAATGATT ATCTGGCGGG AATGGCGCTC
GATCAGCTCA GCAACCAGGC GGCAGTGGGC GGCATTAGCT TTTCCACTAA CGCCAACAAT
GGTCTGATGG TCACTGCGAA CGGTTATACC CAGCGTTTGC CGCAGCTTTT CCTGGCGCTG
CTGGAAGGAT ATTTTAGCTA CGACGCCACG GAGGAACAGC TGGCGCAGGC GAAATCCTGG
TATACGCAGA TGATGGATTC TGCGGAGAAG GGAAAAGCCT ACGAACAGGC GATTATGCCG
GTGCAGATGA TTTCGCAGGT GCCTTATTTT TCCCGTGATG AACGTCGCGC TTTGCTGCCG
TCCATTACGT TAAAAGAGGT GATGGCCTAT CGTAATGCGT TAAAAACGGG CGTTCGTCCG
GAATTTCTGG TTATAGGGAA TATGAGCGAA GCCCAGGCGA CCTCTCTGGC GCAAGATGTC
CAAAAACAGC TCGCGGCGAA CGGATCGGCG TGGTGTCGTA ACAAAGACGT GGTGGTTGAG
AAAAAGCAGT CCGTAATATT TGAAAAAGCG GGCAGTAGCA CCGACTCCGC GTTAGCAGCG
GTCTTTGTCC CGGTCGGCTA CGACGAGTAC GTCAGCGCCG CCTACAGCGC GATGTTAGGT
CAGATTGTTC AACCGTGGTT TTACAATCAG CTACGAACCG AAGAGCAACT GGGATACGCT
GTTTTCGCCT TTCCGATGAG CGTTGGCCGT CAGTGGGGAA TGGGATTCCT GCTACAGAGC
AACGATAAAC AGCCCTCTTA CCTGTGGCAA CGCTATCAGG CATTTTTCCC TGACGCCGAG
GCGAAGCTGA GGGCGATGAA GCCGGAAGAG TTCGCCCAAA TTCAGCAGGC GATCATTACG
CAAATGCGCC AGGCGCCGCA AACGTTGGGC GAAGAAGCAT CCCGTTTAAG CAAGGATTTC
GATCGGGGTA ATATGCGCTT TGACTCGCGT GATAAAATCA TCGCTCAGAT AAAATTGCTG
ACGCCACAAA AGCTTGCCGA TTTCTTCCAC CAGGCGGTGG TGGAACCACA AGGTATGGCA
ATATTGTCAC AGATTGCCGG TAGCCAGAAT GGAAAAGCAG AATACGTGCA TCCGACAGGC
TGGAAAGTGT GGGATAACGT CAGCGCTTTG CAGCAAACGT TACCTCTAAT GAGCGAAAAG
AATGAATGA
 
Protein sequence
MPRSTWFKAL LLLVALWGPA VQADIGWQPL QETIRKSDKD TRQYQAIRLD NDMVVLLVSD 
PQAVKSLSAL VVPVGSLEDP EAHQGLAHYL EHMCLMGSKK YPQADSLAEY LKRHGGSHNA
STAPYRTAFY LEVENDALPG AVDRLADAIA APLLNKKYAE RERNAVNAEL TMARTRDGMR
MAQVSAETIN PAHPGSHFSG GNLETLSDKP GNPVQQALIA FHEKYYSSNL MKAVIYSNKP
LPELASIAAA TYGRVPNKQI KKPEITVPVI TEAQKGIIIH YVPALPRKVL RVEFRIDNNS
AQFRSKTDEL VSYLIGNRSP GTLSDWLQKQ GLVEGISADS DPIVNGNSGV FAISATLTDK
GLANRDEVVA AIFSYLNTLR EKGIDKRYFD ELAHVLDLDF RYPSITRDMD YVEWLADTMI
RVPVAHTLDA ANIADRYDPA AIKNRLAMMT PQNARIWYIS PQEPHNKIAY FVDAPYQVDK
ISEQTFKNWQ QKAQGIALSL PELNPYIPDD FSLVKNDKNY VRPELIVDKA DLRVVYAPSR
YFASEPKADV SVVLRNPQAM DSARNQVLFA LNDYLAGMAL DQLSNQAAVG GISFSTNANN
GLMVTANGYT QRLPQLFLAL LEGYFSYDAT EEQLAQAKSW YTQMMDSAEK GKAYEQAIMP
VQMISQVPYF SRDERRALLP SITLKEVMAY RNALKTGVRP EFLVIGNMSE AQATSLAQDV
QKQLAANGSA WCRNKDVVVE KKQSVIFEKA GSSTDSALAA VFVPVGYDEY VSAAYSAMLG
QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSYLWQ RYQAFFPDAE
AKLRAMKPEE FAQIQQAIIT QMRQAPQTLG EEASRLSKDF DRGNMRFDSR DKIIAQIKLL
TPQKLADFFH QAVVEPQGMA ILSQIAGSQN GKAEYVHPTG WKVWDNVSAL QQTLPLMSEK
NE