Gene SeD_A1370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1370 
Symbol 
ID6875455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1345511 
End bp1347562 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content51% 
IMG OID642784538 
Productprotease 2 
Protein accessionYP_002215208 
Protein GI198245473 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.364308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00000000160982 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGCCAA AAGCCAATCG AATTCCCTAT GCCATGACCG TACATGGCGA TACGCGCATT 
GATAATTATT ACTGGCTGCG AGATGACACT CGCTCGCAGC CGGAAGTCCT TGATTACCTG
CATCAGGAAA ATGAGTATGG CCGGAAGGTC ATGACCTCTC AGCAGGCGTT ACAGGACCGC
ATTCTAAAAG AAATTATCGA TCGCATCCCG CCCAGAGAAG TTTCCGCTCC GTATGTGAAA
AATGGCTATC GCTACCGTTA TATCTATGAA CCCGGCTGCG AATATGCCAT CTATCAACGA
CAATCGGCGT TAAGCGAAGA GTGGGATGTG TGGGAAACCT TGCTCGATGC GAACCAGCGG
GCCGCGCACA GCGAATTTTA TACGCTCGGC GGACTTGCCA TTACGCCGGA TAATACCATC
ATGGCGCTGG CAGAAGATTA TTTATCCCGT CGTCAGTATG GGTTGCGTTT TCGTAACCTC
GAAAGCGGTA ACTGGTATCC GGAACTGCTG GATAACGTTG CGCCTGAATT TGTCTGGGCC
AATGATTCCC TGACCCTTTA CTATGTGCGT AAGCATAAGA AGACGCTGCT GCCCTATCAG
GTTTGGCGGC ACACGATTGG CACTCCGTCA TCGCAAGATG AACTGGTATA TGAAGAGAAA
GACGACACCT TTTATGTCAG CCTGCATAAA ACCACTTCGC AGCATTATGT GGTAATTCAT
CTTGCCAGCG CCACCACTAG CGAAGTGCTA TTACTTGACG CGGAACTGGC CGATGCCGAG
CCGTTTTCAT TCTTACCGCG CCGCAAAGAC CACGAATATA GTCTCGATCA CTATCAACAT
AAGTTTTACC TGCGCTCTAA CCGGAACGGT AAAAACTTTG GGTTGTACCG TACCCGCGTG
CGCAATGAAA ACGCCTGGGA AGAGCTGATC CCTCCGCGCG AGCATATTAT GCTGGAAGGG
TTTACCCTGT TTACCGACTG GTTAGTGGTC GAAGAGCGTC AACGGGGGCT TACCAGCCTG
CGGCAAATTA ACCGTAAAAC CCGTGAAGTG ATAGGCATTG CCTTTGACGA TCCGGCTTAC
GTGACGTGGC TTGCCTATAA TCCCGAACCT GAGACCTCCC GGCTGCGTTA CGGCTATTCT
TCAATGACGA CGCCAGATAC CTTGTTTGAA CTGGATATGG ATACCGGAGA ACGACGGGTA
CTTAAACAGA CGGAAGTGCC TGGGTTTGAT TCTGGCTGTT ATCAGAGCGA ACACCTGTGG
ATCACCGCGC GCGACGGCGT CGAAGTGCCG GTATCGCTGG TTTATCATCA GAAGTATTTT
CGTAAAGGGC AAAATCCGCT TCTGGTTTAC GGCTACGGAT CTTACGGTTC CAGTATTGAC
GCCGACTTCA GCAGCAGCCG ACTGAGCTTG CTGGATCGTG GCTTTGTTTA CGCAATCGTA
CACGTTCGCG GCGGCGGTGA GCTGGGGCAG CAGTGGTATG AAGATGGCAA ATTCCTCAAA
AAGCGGAATA CTTTTAATGA CTATCTTGAT GCCTGCGATG CCTTATTAAA ACTGGGTTAC
GGTTCGCCGT CGCTGTGTTA CGGGATGGGC GGGAGCGCGG GCGGAATGCT AATGGGCGTC
GCTATCAACG AACGCCCCGA GCTTTTCCAC GGCGTTATTG CCCAGGTACC CTTTGTTGAT
GTATTAACCA CGATGCTGGA TGAGTCGATC CCACTAACGA CAGGAGAGTT TGAAGAGTGG
GGGAACCCGC AGGATATTGA GTATTATGAC TATATGAAAA GCTATAGTCC TTATGACAAT
GTCAAAGCGC AGGACTATCC GCACCTGCTG GTGACGACAG GATTGCACGA TTCCCAGGTG
CAATACTGGG AACCTGCGAA GTGGGTGGCA AAATTACGCG AGCTAAAAAC GGACCAACGT
CTGCTGCTGT TATGTACGGA TATGGACTCC GGGCACGGTG GTAAGTCGGG GCGGTTTAAA
TCCTACGAAG GCGTCGCGCT GGAGTTCGCC TTTTTAATCG GCCTGGCGCA GGGAACCTTA
CATAGCGCAT AG
 
Protein sequence
MLPKANRIPY AMTVHGDTRI DNYYWLRDDT RSQPEVLDYL HQENEYGRKV MTSQQALQDR 
ILKEIIDRIP PREVSAPYVK NGYRYRYIYE PGCEYAIYQR QSALSEEWDV WETLLDANQR
AAHSEFYTLG GLAITPDNTI MALAEDYLSR RQYGLRFRNL ESGNWYPELL DNVAPEFVWA
NDSLTLYYVR KHKKTLLPYQ VWRHTIGTPS SQDELVYEEK DDTFYVSLHK TTSQHYVVIH
LASATTSEVL LLDAELADAE PFSFLPRRKD HEYSLDHYQH KFYLRSNRNG KNFGLYRTRV
RNENAWEELI PPREHIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY
VTWLAYNPEP ETSRLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFD SGCYQSEHLW
ITARDGVEVP VSLVYHQKYF RKGQNPLLVY GYGSYGSSID ADFSSSRLSL LDRGFVYAIV
HVRGGGELGQ QWYEDGKFLK KRNTFNDYLD ACDALLKLGY GSPSLCYGMG GSAGGMLMGV
AINERPELFH GVIAQVPFVD VLTTMLDESI PLTTGEFEEW GNPQDIEYYD YMKSYSPYDN
VKAQDYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDQR LLLLCTDMDS GHGGKSGRFK
SYEGVALEFA FLIGLAQGTL HSA