Gene SeSA_A2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2033 
Symbol 
ID6517208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1950368 
End bp1952419 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content51% 
IMG OID642747113 
Productprotease 2 
Protein accessionYP_002114914 
Protein GI194735292 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.976007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCAA AAGCCAATCG AATTCCCTAT GCCATGACCG TACATGGCGA TACGCGCATT 
GATAATTATT ACTGGCTGCG AGATGACACT CGCTCGCAGC CGGAAGTCCT TGATTACCTG
CATCAGGAAA ATGAGTATGG CCGGAAGGTC ATGTCCTCTC AGCAGGCGTT ACAGGACCGC
ATTCTAAAAG AAATTATCGA TCGTATCCCG CCCAGAGAAG TTTCCGCTCC GTATGTGAAA
AACGGCTATC GCTACCGTTA TATCTATGAA CCCGGCTGCG AATATGCCAT CTATCAACGA
CAATCGGCGT TAAGCGAAGA GTGGGATGTG TGGGAAACCT TGCTCGATGC GAACCAGCGG
GCCGCGCACA GCGAATTTTA TACGCTCGGT GGACTTGCCA TTACGCCGGA TAATACCATT
ATGGCGCTGG CAGAAGATTA TTTATCCCGT CGTCAGTATG GGTTGCGTTT TCGTAACCTC
GAAAGCGGTA ACTGGTATCC GGAACTGCTG GATAACGTTG CGCCTGAATT TGTCTGGGCC
AATGATTCCC TGACCCTTTA CTATGTGCGT AAGCATAAGA AGACGCTGCT GCCCTATCAG
GTTTGGCGGC ACACGATTGG CACTCCGTCA TCGCAAGATG AACTGGTATA TGAAGAGAAA
GACGATACCT TTTATGTCAG CCTGCATAAA ACCACTTCGC AGCATTATGT GGTGATTCAT
CTTGCCAGCG CCACCACTAG CGAAGTGCTG TTACTTGACG CGGAACTGGC CGATGCCGAG
CCGTTTTCAT TCTTACCGCG CCGCAAAGAC CACGAATATA GTCTCGATCA CTATCAACAT
AAGTTTTACC TGCGTTCTAA CCGGAACGGT AAAAACTTTG GGTTGTACCG TACCCGCGTG
CGCAATGAAA ACGCCTGGGA AGAGCTGATC CCTCCGCGCG AGCATATTAT GCTGGAAGGG
TTTACCCTGT TTACCGACTG GTTAGTGGTC GAAGAGCGTC AACGGGGGCT TACCAGCCTG
CGGCAAATTA ACCGTAAAAC CCGTGAAGTG ATAGGCATTG CCTTTGACGA TCCGGCTTAC
GTGACGTGGC TTGCCTATAA TCCCGAACCT GAGACCTCCC GGCTGCGTTA CGGCTATTCT
TCAATGACGA CGCCAGATAC CTTGTTTGAA CTGGATATGG ATACCGGAGA ACGACGGGTA
CTTAAACAGA CGGAAGTGCC TGGGTTTGAT TCTGGCTGTT ATCAGAGCGA ACACCTGTGG
ATCACCGCGC GCGACGGCGT CGAAGTGCCG GTATCGCTGG TTTATCATCA GAAGTATTTT
CGTAAAGGGC AAAATCCGCT TCTGGTTTAC GGCTACGGAT CTTACGGTTC CAGTATTGAC
GCCGACTTCA GCAGCAGCCG ACTGAGCTTG CTGGATCGTG GCTTTGTTTA CGCAATCGTA
CACGTTCGCG GCGGCGGTGA GCTGGGGCAG CAGTGGTATG AAGATGGCAA GTTCCTCAAA
AAGCGGAATA CTTTTAATGA TTATCTTGAT GCCTGCGATG CCTTATTAAA ACTGGGTTAC
GGTTCGCCGT CGCTGTGTTA CGGGATGGGC GGGAGCGCGG GCGGAATGTT AATGGGCGTC
GCTATCAACG AACGCCCCGA GCTTTTCCAC GGCGTTATTG CCCAGGTACC CTTTGTTGAT
GTATTAACCA CGATGCTGGA TGAGTCGATC CCACTAACGA CAGGAGAGTT TGAAGAGTGG
GGGAACCCGC AGGATATTGA GTATTATGAC TATATGAAAA GCTATAGTCC TTATGACAAT
GTCAAAGCGC AGGACTATCC GCACCTGCTG GTGACGACAG GATTGCACGA TTCCCAGGTG
CAATACTGGG AACCGGCGAA GTGGGTGGCA AAATTACGCG AGCTAAAAAC GGACCAACGT
CTGCTGCTGT TATGTACGGA TATGGACTCC GGGCACGGTG GTAAGTCGGG GCGGTTTAAA
TCCTACGAAG GCGTCGCGCT GGAGTTCGCC TTTTTAATCG GCCTGGCGCA GGGAACCTTA
CATAGCGCAT AG
 
Protein sequence
MLPKANRIPY AMTVHGDTRI DNYYWLRDDT RSQPEVLDYL HQENEYGRKV MSSQQALQDR 
ILKEIIDRIP PREVSAPYVK NGYRYRYIYE PGCEYAIYQR QSALSEEWDV WETLLDANQR
AAHSEFYTLG GLAITPDNTI MALAEDYLSR RQYGLRFRNL ESGNWYPELL DNVAPEFVWA
NDSLTLYYVR KHKKTLLPYQ VWRHTIGTPS SQDELVYEEK DDTFYVSLHK TTSQHYVVIH
LASATTSEVL LLDAELADAE PFSFLPRRKD HEYSLDHYQH KFYLRSNRNG KNFGLYRTRV
RNENAWEELI PPREHIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY
VTWLAYNPEP ETSRLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFD SGCYQSEHLW
ITARDGVEVP VSLVYHQKYF RKGQNPLLVY GYGSYGSSID ADFSSSRLSL LDRGFVYAIV
HVRGGGELGQ QWYEDGKFLK KRNTFNDYLD ACDALLKLGY GSPSLCYGMG GSAGGMLMGV
AINERPELFH GVIAQVPFVD VLTTMLDESI PLTTGEFEEW GNPQDIEYYD YMKSYSPYDN
VKAQDYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDQR LLLLCTDMDS GHGGKSGRFK
SYEGVALEFA FLIGLAQGTL HSA