Gene SNSL254_A2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2038 
Symbol 
ID6483066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1977428 
End bp1979479 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content51% 
IMG OID642737394 
Productprotease 2 
Protein accessionYP_002041144 
Protein GI194446810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.387089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.101545 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCAA AAGCCAATCG AATTCCCTAT GCCATGACCG TACATGGCGA TACGCGCATT 
GATAATTATT ACTGGCTGCG AGATGACACT CGCTCGCAGC CGGAAGTCCT TGATTACCTG
CATCAGGAAA ATGAGTATGG CCGGAAGGTC ATGTCCTCTC AGCAGGCGTT ACAGGACCGC
ATTCTAAAAG AAATTATCGA TCGTATCCCG CCCAGAGAAG TTTCCGCTCC GTATGTGAAA
AACGGCTATC GCTACCGTTA TATCTATGAA CCCGGCTGCG AATATGCCAT CTATCAACGA
CAATCGGCGT TAAGCGAAGA GTGGAATGTG TGGGAAACCT TGCTCGATGC GAACCAGCGG
GCCGCGCACA GCGAATTTTA TACGCTCGGC GGGCTTGCCA TTACGCCGGA TAATACCATC
ATGGCGCTGG CAGAAGATTA TTTATCCCGT CGTCAGTATG GGTTGCGTTT TCGTAACCTC
GAAAGCGGTA ACTGGTATCC GGAACTTCTG GATAACGTTG CGCCTGAATT TGTCTGGGCC
AATGATTCCC TGACCCTTTA CTATGTGCGT AAGCATAAGA AGACGCTGCT GCCCTATCAG
GTTTGGCGGC ACACGATTGG CACTCCGTCA TCGCAAGATG AACTGGTATA TGAGGAGAAA
GACGATACCT TTTATGTCAG CCTGCATAAA ACCACTTCGC AGCATTATGT GGTGATTCAT
CTTGCCAGCG CCACCACTAG CGAAGTGCTA TTACTTGACG CGGAACTGGC CGATGCCGAG
CCGTTTTCAT TTTTACCGCG CCGCAAAGAC CATGAATATA GTCTCGATCA CTATCAACAT
AAGTTTTACC TGCGCTCTAA CCGGAACGGT AAAAACTTTG GGTTGTACCG TACCCGCGTG
CGCAATGAAA ACGCCTGGGA AGAGCTGATC CCTCCGCGCG AGCATATTAT GCTGGAAGGG
TTTACCCTGT TTACCGACTG GTTAGTGGTC GAAGAGCGTC AACGGGGGCT TACCAGCCTG
CGGCAAATTA ACCGTAAAAC CCGTGAAGTG ATAGGCATTG CCTTTGACGA TCCGGCTTAC
GTGACGTGGC TTGCCTATAA TCCCGAACCT GAGACCTCCC GGCTGCGTTA CGGCTATTCT
TCAATGACGA CGCCAGATAC CTTGTTTGAA CTGGATATGG ATACCGGAGA ACGACGGGTA
CTTAAACAGA CGGAAGTGCC TGGGTTTGAT TCTGGCTGTT ATCAGAGCGA ACACCTGTGG
ATCACCGCAC GCGACGGCGT CGAAGTGCCG GTATCGTTGG TTTATCATCA GAAGTATTTT
CGTAAAGGGC AAAATCCGCT TCTGGTTTAC GGCTACGGAT CTTACGGTTC CAGTATTGAC
GCCGACTTCA GCAGCAGCCG ACTGAGCTTG CTGGATCGTG GCTTTGTTTA CGCAATCGTA
CACGTTCGCG GCGGCGGTGA GCTGGGGCAG CAGTGGTATG AAGATGGCAA ATTCCTCAAA
AAGCGGAATA CTTTTAATGA CTATCTTGAT GCCTGCGATG CCTTATTAAA ACTGGGTTAC
GGTTCGCCGT CGCTGTGTTA TGGGATGGGC GGGAGCGCGG GCGGAATGCT AATGGGCGTC
GCTATCAACG AACGTCCCGA GCTTTTTCAC GGCGTTATTG CCCAGGTACC CTTTGTTGAT
GTATTAACCA CGATGCTGGA TGAGTCGATC CCACTAACGA CAGGAGAGTT TGAAGAGTGG
GGGAACCCGC AGGATATTGA GTATTATGAC TATATGAAAA GCTATAGTCC TTATGACAAT
GTCAAAGCGC AGGACTATCC GCACCTGCTG GTGACGACAG GATTGCACGA TTCCCAAGTG
CAATACTGGG AACCGGCGAA GTGGGTGGCA AAATTACGCG AGCTAAAAAC GGACCAACGT
CTGCTGCTGC TATGTACGGA TATGGACTCC GGGCACGGTG GTAAGTCGGG GCGGTTTAAA
TCCTACGAAG GCGTCGCGCT GGAGTTCGCC TTTTTAATCG GCCTGGCGCA GGGAACCTTA
CATAGCGCAT AG
 
Protein sequence
MLPKANRIPY AMTVHGDTRI DNYYWLRDDT RSQPEVLDYL HQENEYGRKV MSSQQALQDR 
ILKEIIDRIP PREVSAPYVK NGYRYRYIYE PGCEYAIYQR QSALSEEWNV WETLLDANQR
AAHSEFYTLG GLAITPDNTI MALAEDYLSR RQYGLRFRNL ESGNWYPELL DNVAPEFVWA
NDSLTLYYVR KHKKTLLPYQ VWRHTIGTPS SQDELVYEEK DDTFYVSLHK TTSQHYVVIH
LASATTSEVL LLDAELADAE PFSFLPRRKD HEYSLDHYQH KFYLRSNRNG KNFGLYRTRV
RNENAWEELI PPREHIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY
VTWLAYNPEP ETSRLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFD SGCYQSEHLW
ITARDGVEVP VSLVYHQKYF RKGQNPLLVY GYGSYGSSID ADFSSSRLSL LDRGFVYAIV
HVRGGGELGQ QWYEDGKFLK KRNTFNDYLD ACDALLKLGY GSPSLCYGMG GSAGGMLMGV
AINERPELFH GVIAQVPFVD VLTTMLDESI PLTTGEFEEW GNPQDIEYYD YMKSYSPYDN
VKAQDYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDQR LLLLCTDMDS GHGGKSGRFK
SYEGVALEFA FLIGLAQGTL HSA